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EDITORIAL 


Assessing Classroom Achievement 


John O. Anderson 


University of Victoria 


There is increased interest in the relationship between educational measure- 
ment and the classroom practices used in evaluating student achievement 
(Stiggins, 1988). Questions have been raised as to the utility of procedures 
developed through educational measurement for teaching and learning. 

Although the testing of student achievement is common practice in edu- 
cational settings, only recently have researchers begun to study classroom 
testing in distinction from psychometrics. Educational measurement 
specialists have attempted to bridge the gap between psychometrics and the 
measurement of educational achievement in the classroom. The educational 
measurement literature addresses issues such as test reliability, validity, 
standard error of measurement, item bias, test equating, models underlying 
response data, procedures for test design and construction, item analysis, 
test scoring and analysis, and standard setting. But this work is based 
primarily on large-scale test programs in which the test is administered, 
analyzed, and interpreted as a single, isolated entity such as a final examina- 
tion, a system-wide assessment, or a commercially developed standardized 
test. In contrast, the assessment of student achievement by teachers in 
classrooms is characterized by multiple testings or data collections through- 
out an instructional period using an eclectic array of information generating, 
collecting, and collating procedures (Anderson, 1989). 

This special issue of The Alberta Journal of Educational Research con- 
tains papers presented to the Conference on Classroom Testing held at the 
University of Victoria in February 1989. The focus of the conference was 
classroom testing: What testing is going on in classrooms? What is the 
climate of educational testing? What could be going on, and what should be 
going on in classrooms in regard to testing the educational achievement of 
students? The intent of the conference was to bring researchers together to 
consolidate some of the major elements of this emergent field and articulate 
directions for future research initiatives. The conference was jointly sup- 
ported by funding from the Social Sciences and Humanities Research Council 
of Canada and the British Columbia Ministry of Education. 

The participants are active researchers in the field of classroom testing 
who are making significant contributions to the development of classroom 
testing as a distinct measurement activity and research field. The papers are 
organized into the following sections. 
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Current Research on Classroom Testing 


Robert J. Wilson of Queen’s University reported the findings to date of the 
Ecology of Evaluation: Evaluation within Educational Settings, a project 
examining the student evaluation practices of classroom teachers. The study 
involves a number of schools throughout Canada in an intensive descriptive 
study of the methods and instruments used in the evaluation of student 
achievement. Philip Nagy of the Ontario Institute for Studies in Education 
and Roslyn Moorhead of The University of Western Ontario reported on the 
results of a systematic analysis of administrator response to the results of an 
investigation into classroom testing practices. The study analyzed the prob- 
lem solving behavior of school principals in regard to testing situations in a 
school setting. Further, administrator responses to discrepancies in class- 
room testing practices were also reported. Clement Dassa of the Université 
de Montréal reported on the project Elaboration d’un modeéle d’integration 
du diagnostic pédagogique aux apprentissages scolaires. The project is being 
conducted on site in a school district in Quebec by a team of researchers at 
the Université de Montréal. David Bateson of The University of British 
Columbia reported on the testing and grading practices of British Columbia 
science teachers. The data were derived from a survey of approximately 1000 
grade 4, 7, and 10 science teachers throughout British Columbia. The major 
issues addressed were the sources of grading information used in evaluating 
student performance, the connection of grading information to curricular 
emphasis, the emphasis placed on different types of measurement procedures 
and instruments, and the patterns of distributions of grades awarded to 
students. 


Current Educational Climate in Relation to Testing 


Todd Rogers of the University of Alberta described and discussed the find- 
ings of three major investigations into perceptions of public education by 
students, teachers, parents, trustees, ministry officials, and politicians. Par- 
ticular attention was paid to those facets with relevance to student evalua- 
tion. The investigations involved province-wide surveys of public perceptions 
of the provincial educational system: Let’s Talk About Schools (1984, a 
British Columbia survey of public attitudes toward the provincial educational 
system) and The Royal Commission on Education (1988). Rogers was a key 
investigator in both of these studies. 


What Could Be Going On: Available Technologies 
and Procedures for Classroom Testing 


Dan Bachor of the University of Victoria described a set of procedures which 
use students’ daily classroom performance as the basis of assessment. These 
procedures allow the regular classroom teacher to determine if a concept has 
been learned. They also provide the special education teacher with a techni- 
cally adequate sample of classroom performance that can be translated 
directly into instructional objectives. Leslie McLean of the Ontario Institute 
for Studies in Education developed the theme that specialists in educational 
measurement have given too much attention to the technical aspects of 
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measurement (e.g., reliability, item analysis, item response models) and too 
little attention to the educational aspects (e.g., motivation, contribution to 
teaching and learning). A broader concept of assessment of achievement is 
advocated, building on work now well advanced in the United Kingdom 
under the heading “Portfolios” or “Records of achievement.” The record of 
achievement provides a basis for grading while enhancing motivation and 
documenting attainment comprehensively for parents and employers. Tests 
can be part of the record, but they play very much a supporting role. Prob- 
lems of standards and comparability are addressed. 


Consolidation and Research Directions 


Ross Traub of the Ontario Institute for Studies in Education and Richard 
Stiggins of the Northwest Regional Educational Laboratory independently 
provided analyses and syntheses of the papers presented at this conference 
with a view to identifying researchable problems in classroom testing and 
student evaluation, and description and discussion of emergent research 
trends. 
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ROBERT J. WILSON 


Queen’s University 


Classroom Processes in Evaluating 
Student Achievement 


The purpose of this study was to determine whether viewing classroom processes in evaluat- 
ing student achievement from a decision making model would account for the observed 
practices. Forty-nine classrooms participated in the study. The classrooms ranged from 
Kindergarten to Grade 13 and were situated in Ontario and British Columbia. Teachers 
collected all the instruments they used with one class for one subject over the course of a 
reporting period. They also completed a checklist for each instrument and a log for the 
entire process. In addition, an interview was held with each teacher at the end of the 
reporting period. The results tended to show that teachers beyond the early primary grades 
use their evaluations mainly to generate marks for reporting purposes. These reports 
typically reaffirm judgments about achievement rather than develop them, with the result 
that very little alteration of instruction is made as a result of the evaluation activity. 


One approach to describing classroom practices in evaluating students could 
be to determine the degree to which these practices match those engaged in 
or approved of by professional evaluators. If a gap is found, then ameliorative 
action springs fairly readily from the analysis. This is the model underlying 
Frisbie and Friedman’s (1987) suggestions concerning the applicability of the 
Test Standards as a model for preservice instruction and was also the as- 
sumption underlying Mayo’s (1964) paper in the inaugural issue of the 
Journal of Educational Measurement. 

What researchers like Gullickson (1984) and Stiggins and Bridgeford 
(1985) have been discovering in their analyses of classroom practice is that 
the world of the classroom teacher seems so different from that of the 
professional evaluator that the gap, far from being narrowed by decades of 
preservice and inservice instruction, is as wide as ever. 

One possible way to reconcile these approaches would be to see both 
enterprises as exercises in informed decision making. A model based on this 
construct would see both classroom-based evaluators and professional test- 
makers as engaging in the activities of helping to choose among possible 
courses of action through the provision of timely information to inform 
judgment. 

Such a model is worked out in more detail in Wilson and Rees (in press) 
but will be described briefly here. It assumes that evaluation can be conceived 
of as three interrelated processes: measurement, judgment, and decision 
making. Measurement is defined, traditionally, as the assigning of numbers 
to events according to rules. A watch is a measuring instrument because the 
rules for defining units, for calibration, and so on are known and accepted 
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widely. Judgment is defined as the conceptual act of matching the informa- 
tion gleaned from the measurement with a set of expectations, a test of fit. 
Thus we check the watch, not primarily to determine the time, but to 
compare the information it gives us to various mental benchmarks. If the 
information is that it is earlier than 10:00, then we refrain from going for 
coffee, but if it is after 10:00, we go. These benchmarks constitute decision 
points. Thus the third aspect of the model, decision making, requires a 
selection of a course or courses of action, derived from the measurement- 
bound judgments that activate the decision. 

The criteria for these elements in the decision making system emerge 
fairly directly from the overall purpose. The goal of measurement is to 
provide accurate information under certain prescribed conditions, and the 
criterion for accuracy of measurement is reliability. Much effort is expended 
in professional test development to enhance the degree of reliability instru- 
ments might possess in various situations. Judgments, however, take the 
issue partially out of the technical realm by introducing the concept of use. 
The degree to which an instrument’s results can help in forming adequate 
judgments about a situation is a measure of the instrument’s validity. As 
various authorities remind us, validity is in the hands of the user; although 
the test maker may anticipate and recommend certain appropriate uses of a 
test, the client is the ultimate arbiter of validity. Finally, decisions made as a 
result of the testing and its interpretation may be seen as helpful, harmful, or 
neutral. The criterion of utility can be invoked here as relevant for determin- 
ing whether the courses of action followed as a result of the measurement 
activity constitute a net gain on those that might have been followed anyway 
or those that might have been followed with a different measure. 

The application of this approach to the work of the professional evaluator 
can be quite linear and straightforward. Most test makers define a small 
number of discrete purposes for an instrument, work at creating an accept- 
able level of objectivity for it, and provide evidence that judgments which 
flow from an adequate interpretation of certain patterns of results will be 
better than would be the case without the measure. Because the professional 
has control over the measurement, most attention will be paid to its charac- 
teristics. As the test moves into its judgment and decision making phases, 
into use, the ability of the professional to ensure validity and utility shrink 
although responsibility for contributing to them does not. The appropri- 
ateness of the decision making approach to classroom evaluation, however, is 
less direct. Singleness of purpose, so helpful in aiding in the technical sophis- 
tication of the professional, is usually absent in the classroom. The teacher as 
evaluator serves many levels in a system (students, department heads, prin- 
cipals, superintendents, and Ministry officials) as well as some outside the 
formal system (parents, colleges and universities, employers). All these 
groups have insistent and frequently contradictory uses for the evaluation 
information a teacher may generate. Consequently, the tidiness of aim, so 
necessary for the development of objective measurement, is replaced with a 
variety of purposes that renders the teacher’s decision making process far 
more complex (Wilson, 1989; Rees, 1989). 


R.J. Wilson 


Purpose 


The research to be reported here (which is part of a larger program conducted 
by Wilson and Rees) had as its overriding aim the charting of the policy and 
procedural frameworks for decision making in which teachers and others in 
the levels described above make their measures and judgments. As part of the 
analysis of that system, the classroom level and its policies and procedures 
constituted an important element. In this report, the classroom processes in 
evaluation are the focus, and only passing references are made to the other 
aspects of the investigation. The question this paper addresses is: To what 
extent does a decision making model account for the practice of student 
evaluation as designed and conducted by teachers in classrooms? 


Method 


Design 

Because of the overall purpose of the research, a multisite, nonrandom 
sample of classrooms was selected for this study. Two provinces, British 
Columbia and Ontario, constituted the primary sampling unit. These provin- 
ces differ markedly in the way they create, disseminate, and manage educa- 
tional policy, with British Columbia operating from a more top-down 
approach and Ontario from a more differentiated policy framework. Two 
districts in British Columbia and two boards in Ontario constituted the next 
level with variation in student enrollment between the districts and boards 
being the primary selection variable. In each of these units, two elementary 
and two secondary schools were selected. In each elementary school, one 
primary class and one junior (Ontario) or intermediate (British Columbia) 
class was selected. In each secondary school, four classes representing a 
variety of subject matters and departments were selected. 

As a result of this process, which included the opportunity for self-selec- 
tion at the school level, a final sample of 49 classrooms was obtained. The 
obtained sample contained two anomalies. Two classes in a secondary school 
in British Columbia produced inadequate data for inclusion, and in the same 
province, an oversubscription of participants at the elementary level pro- 
duced an increase of three classrooms in that portion of the sample. 

Table 1 indicates the grade level frequencies participating in the study. As 
British Columbia does not have Grade 13 in its system, this grade is not 
represented there. In other respects, the obtained sample provides a fairly 
complete range of classes in the public systems of both provinces, but one 
that is obviously quite restricted. 


Characteristics of the Classrooms 


The elementary teachers who agreed to participate in the study were allowed 
to select one of the subjects they taught for data collection purposes. The 
secondary teachers were categorized by various departments to ensure a 
broad coverage of subject specialties. Table 2 indicates the subject matter 
that was ultimately obtained for the data collection exercise described below. 
The designation “integrated” accommodated those primary teachers who do 
not teach subjects discretely. Two of the teachers monitored individual stu- 
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Table 1 
Grade Levels in the Study, by Province 


Grade Kent 2 3 4 5 6 7 8 Oe 10a 812 
British Columbia . 1 1 2 1 1 3 2 0 4 2 0 4 4 
Ontario 1 1 1 1 1 0 2 0 0 3 3 4 2 
Total el. dP3 3 2 (2 3 4 0 4 5 3 8 6 


Note: Ontario also had 4 grade 13 classes and 1 special class. 


dents rather than individual subjects because of the way in which their 
curriculum and instruction was organized. The only imbalances in subject 
matter between the two provinces occur in elementary science where three 
teachers in British Columbia monitored science classes while none did in 
Ontario, and at the secondary level, the arts in Ontario were more heavily 
represented in the obtained sample than was the case in British Columbia. 


Instrumentation and Data Collection 


Once the teachers in the study had been identified by their principals or 
department heads, they were contacted by one of the two principal inves- 
tigators and informed of the purposes and procedures of the study. 

Each teacher was required to choose a reporting period for a particular 
subject and to collect all the evaluation instruments used with that class for 
that period. An instrument was defined as “any test, rating scale, project, 
checklist, or assignment used to assess student performance.” At the elemen- 
tary level, a reporting period was usually interpreted to mean the time 
between the issuance of report cards, whereas at the secondary level, it was 
usually interpreted to mean a semester or a term. Teachers were given the 
opportunity to define reporting period in a way that was congruent with their 
teaching. 


Table 2 
Subjects Represented by Province and by Level 


Level British Columbia Ontario Total 
Elementary 
English 2 2 4 
Maths 5 4 9 
Science 3 0 3 
Integrated 1 1 2 
Special Education 0 1 1 
Secondary 
Humanities 3 3 6 
Maths 2 fe 4 
Science 3 4 7 
Social Studies 5 4 9 
Arts & Physical Education 1 3 4 
Totals 25 24 49 


R.J. Wilson 


Two data collection instruments were given to the teachers for comple- 
tion. The first was a log on which the date, instrument description, number 
of students completing each instrument, number of A’s, number of F’s, 
average, and total marks were recorded. Definitions for these categories were 
stated on the log itself for reference. The number of A’s, for example, was 
defined as “the number of students receiving the highest grade designation” 
on the instrument. The number of F’s was defined as “the number of stu- 
dents who did not receive a passing grade.” The average could be given as 
either a mean or a median. 

The second data collection instrument was a checklist that was appended 
to each instrument. It required the teacher to indicate which of 10 common 
purposes this instrument was intended to serve, how well the students had 
performed on the instrument in the teacher’s judgment, what measurement 
characteristics the instrument possessed (written in nontechnical language), 
and where the instrument had originated. The checklist was completed or 
attempted for 776 instruments. 

At the conclusion of the data gathering period, each teacher was inter- 
viewed in a structured format. Interviews averaged about 45 minutes and 
were conducted one-on-one with either one of the two principal investigators 
or a research associate. Recordings of the interviews were not made but 
detailed notes were kept and subsequently transcribed. The purpose of the 
interview was to explore the policies and procedures that teachers followed in 
their evaluation activities. Questions dealing with informing their students 
of the expectations of the course or unit, the preparation and construction of 
the measures used, types of items favored, changes in evaluation over the 
time period, use of administered instruments by students and teachers, and 
sources of guidance for evaluation activities were included in the interview 
format. 


Data Entry and Analyses 


The data from the logs and checklists were quantified and analyzed using 
descriptive statistics, cross-tabulations, and correlations. The data from the 
interviews were transcribed from notes, returned to the participants for 
reliability and validity checks, and their amended responses incorporated 
into the recorded notes. Five of the 49 teachers participating returned a- 
mended copies of the interview summaries. 

Obviously, no legitimate attempt can be made from these data to make 
generalizations across classes, departments, schools, districts or boards, or 
provinces. Nevertheless, they may be suggestive of trends and approaches 
that could lend themselves to a more systematic sampling plan and a more 
precise definition of variables in the future. It is in that exploratory vein that 
the following report of the data and their interpretations is made. 


Results 


Characteristics of the Measurement 

Frequency. Table 3 displays the frequency with which the teachers report- 
ed administering instruments to their selected classes. Direct comparisons 
among teachers are clouded by the understanding that each teacher could 
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Table 3 
Mean Number of Instruments per Class, by Province, Level, and Grade 


Grade British Columbia Ontario 
Elementary 
Kindergarten 34 4 
1 12 7 
2 Zio 8 
3 10 14 
4 Ae, 17 
s) 9.3 
6 15 38 
Special 56 
Elementary Means 15.8 227), 
Secondary 
8 14.5 
9 duhess 19.3 
10 15.3 
11 13s 16 
12 10,75 15:35 
13 9.5 
Secondary Means 12.7 14.8 
Total Means 14.2 dive 


decide on the time period for reporting results. Using the concept of the 
reporting period, however, leaves the impression that students in both pro- 
vinces would be assessed between 14 and 17 times on average in one course 
for one reporting period. This is consistent with the findings of another study 
(Wilson, 1989) that documented the frequency of evaluations for a secondary 
school in Ontario for a single semester. If any trend is discernible in the data, 
it maybe that there is a tendency in British Columbia to be fairly consistent 
across grade levels in terms of the number of evaluations conducted, but that 
generally in the Ontario sample, there is a tendency to measure somewhat 
more frequently than is the case in British Columbia. 

Another aspect of evaluation frequency is its regularity. The most com- 
mon pattern of evaluation indicated a fairly equal spacing of the evaluations 
over the course of the reporting period. Some logs indicated that the evalua- 
tions became more heavily weighted over the term, but many did not. Some 
teachers reported increasing the number of evaluations near the end of the 
reporting period as report card time approached (at the elementary level in 
both provinces). 

At the secondary level, the suggestion that the subject matter might lend 
itself to a simple-to-complex rather than a regular pattern of assessment was 
examined. Such a pattern did not emerge from the interview and log data. 
Whereas some social studies teachers, for example, did favor the simple-to- 
complex model, others evaluated on as regular a basis as did their colleagues 
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Table 4 
Origin of Instruments 
(Column Percentages Given in Parentheses) 


British Columbia Ontario Total 
Elem Sec Elem Sec 

Borrowed From 93 46 67 oo 239 
Other Sources (59) (26) (37) (16) (33) 
Adapted From 10 26 35 57 128 
Other Sources (6) (15) (20) (27) (18) 
From Own Files 5 6 20 18 49 
(3) (3) (11) (9) (7) 

Created It Myself 49 98 57, 101 305 
(31) (56) (32) (48) (42) 

Total {eye 176 179 209 721 


in mathematics and biology, some of whom used the simple-to-complex 
scheme or some combination of both approaches. 

Origination. Both provinces have provided evaluation resources for edu- 
cators to use in their classroom assessments if they so choose. For 721 of their 
instruments teachers indicated where these had originated. (See Table 4 for 
a level-by-province breakdown of the sources of instruments.) 

More of the secondary instruments than the elementary instruments 
were created wholly by the teachers. Fifty-two percent of the secondary 
instruments were reported as self-created while 32% of the elementary 
instruments were so described. Most of the rest of the instruments were 
either borrowed or adapted from other sources. In interviews with the teach- 
ers it was determined that these other sources might be published tests or 
item pools but were more often workbooks, texts, guidelines, and in British 
Columbia previously administered government examinations. The major 
complaint about outside evaluation tools was that they usually needed to be 
altered to fit the expectations of the particular class or course in which they 
might be used. 

A total of 7% of the instruments, more in the Ontario sample than in the 
British Columbia sample, was reported to be located in test files. The rather 
insignificant use of previously used test materials may account for the almost 
total lack of response to an interview question concerning item tryouts, 
validation of keys, and item analysis generally. While one or two teachers 
reported using machine scoring for some tests, a scoring which produced 
difficulty level information for items, most had no experience with test 
statistics other than medians or means nor with the concepts of test or item 
analysis. 

Item types. It was possible to categorize the different type of items used on 
the instruments in three ways. Certain items required the student to select 
information from choices given (alternative response, multiple choice, and 
matching), while others required the student to provide an answer (comple- 
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tion, short answer, essay, and construction). A third type required the stu- 
dent to perform a task, and that performance would be rated subjectively 
(anecdotal report) or objectively (checklist or rating scale) by the teacher. 

Teachers typically reported using three or four of these item types in their 
evaluation, and defended their choices of types in a variety of ways. Some 
indicated that the age of the student meant that certain ways of evaluating 
were more appropriate. In both provinces, there was a general trend toward 
using performance measures at the primary level, while moving toward 
completion and, to some degree, multiple choice at the junior/intermediate 
grades. 

At the secondary level in both provinces, the content of the subject was 
often given as a reason for the use of particular item types. Essay answers, 
for example, were cited as fitting with the objectives of particular courses 
better than other item types. But the pragmatics of evaluation also featured 
quite prominently in secondary teachers’ reasons for item use. Conditions 
such as appearance on government examinations, need to diversify ways of 
collecting information, and ease of construction and marking were also 
frequently cited as reasons for use of particular item types. 

The number of instruments reporting at least one example of each item 
type is recorded in Table 5. While much of these data could be shaped by 
particular teachers using many evaluations, there is a clear indication that 
short answers requiring single word or sentence responses characterize 
much of the evaluation at both levels. Objective performance items (check- 
lists and rating scales) were also commonly used, especially in Ontario. The 
anomaly of the use of matching items at the secondary level in British 
Columbia can be explained by its appearance on some government examina- 
tions. It is the most common selection type of item in widespread use among 
the teachers in this sample although multiple choice was the third-ranked 
item in use by elementary teachers in Ontario. 


Characteristics of the Judgments 


For each instrument included, teachers were asked to rate the obtained 
achievement of their students compared to what the teachers’ expected them 
to achieve. A consistent pattern emerged across levels and provinces con- 
cerning this aspect of judgment as shown in Table 6. More than half the time, 
the students performed about as well as teachers expected. For the total 
group about one fifth of the results were better than expected. This finding 
was reinforced by spontaneous comments by teachers about the relationships 
they expect between the measures they create and their judgments about 
achievement. Typical comments were as follows: 

An elementary teacher in British Columbia, “Things do not change too much. 
A C+ in September is usually a C+ in June.” 

A secondary teacher in British Columbia, “Basically [you get] the same 
results if you give two tests or a thousand, it ranks them the same way.” 

An elementary teacher in Ontario, “Each year the tests are tailored to the 
class.” 
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Table 5 
Number of Instruments Using Particular Item Types by Level and by Province 
(Percentages of Total Given Within Columns) 


Item Type British Columbia Ontario Total 
Elem Sec Elem Sec 
Selection 
Multiple 5 19 38 19 81 
Choice (3) (7) (18) (6) (8) 
Matching 0 34 ; 5 8 47 
(0) (13) (2) (2) (5) 
Alternative 6 14 9 tA 36 
Response (3) (5) (4) (2) (4) 
Supply 
Completion 87 81 76 59 303 
(47) (30) (36) (18) (30) 
Short Answer 8 61 10 60 139 
(4) (23) (5) (18) (14) 
Essay 8 16 9 50 83 
(4) (6) ra (4) (15) (8) 
Construction 9 18 18 45 90 
(5) (7) (8) (14) (9) 
Performance 
Objective 22 26 46 65 159 
Performance (12) (10) (21) (20) (16) 
Subjective 42 2 3 14 61 
Performance (22) (1) (1) (4) (6) 
Total 187 27| 214 327 999 
Table 6 


Teachers’ Expectations of Student Performance 
(Column Percentages Given in Parentheses) 


British Columbia Ontario Total 
EXPECTATIONS Elem Sec Elem Sec 
Better than expected 27 46 59 38 170 
(15) (26) (32) (16) (22) 
About as well as expected 79 96 104 137 416 
(45) (54) (57) (58) (54) 
Poorer than expected 11 26 5 33 75 
(6) (15) (3) (14) (10) 
Other or omit 60 10 16 29 115 
(34) (6) (9) (12) (15) 
Total AZ 178 184 PH / 776 
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A secondary teacher in Ontario, “When you’ve been teaching that long, 
you've got a pretty good handle on it. By the time I get the names down, I 
have a pretty good idea of where they are.” 

Such expectations indicate that the teachers predict a pattern of results 
based on their previous experiences with the students forming a particular 
class. However, they contradicted the analysis in Table 6 when, in describing 
their instruments on the checklist, they only checked 20% of them as con- 
firming an informal assessment. 


Characteristics of the Decision Making 


Each teacher was asked to rate the purposes of each instrument with no 
restriction on the number of purposes each could serve. The results are given 
in Table 7. The overwhelming purpose checked for most instruments was the 
generation of marks for reporting purposes (81%). At the elementary level in 
both provinces, the need to diagnose students’ weaknesses with material was 
also a frequently cited purpose. At the secondary level in both provinces, the 
checking of students’ progress against course objectives was the second most 
frequently checked purpose. 

Certain purposes can be deemed important to the teaching-learning pro- 
cess while others are important for external purposes. At the elementary 


Table 7 
Percentage of Instruments Citing Purposes of Evaluation 
(Rank Within Column Given in Parentheses) 


British Columbia Ontario Total 
PURPOSES Elem Sec Elem Sec 
Marks 86 (1) 80 (1) 77 (3) 81 (1) 81 
Criterion-Reference 44 (4) 70 (2) 71 (4) 78 (2) 67 
Apply learning 48 (3) 60 (4) 82 (2) 67 (3) 65 
Diagnosis 56 (2) 47 (8) 90 (1) 37 (8) 56 
Self-Monitoring 28 (8) 5315) 62 (5) 41 (7) 46 
Prepare for future exams 28 (9) 62 (3) 29 (8) 48 (5) 42 
Assignment complete 26 (10) 50 (6) 29 (9) 49 (4) 39 
Self-Teaching slot ¢a) 47 (7) 43 (6) 33 (9) 39 
Norm-Reference Pls Ali i) 30 (9) 25(11) 42 (6) 31 
Teach next 40 (5) 23 (10) creda 5 (11) 25 
Other 37 (6) 5.(11) 28(10) 13 (10) 10 


Note: Marks: to generate marks for reporting; Criterion-Reference: to check students’ progress 
against course objectives; Apply learning: to have students practice or apply what has been 
learned; Diagnosis: to diagnose students’ weaknesses with the material; Se/f-Monitoring: to 
enable students to monitor their own progress; Prepare for future exams: to prepare students 
for this kind of evaluation in the future; Assignment complete: to ensure that students do 
assigned work; Self-Teaching: to allow me to see how well | taught the material; 
Norm-Reference: to compare students’ achievement to others; Teach next: to help me decide 
what to teach next. 
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level in both provinces, the five top-rated purposes were concerned with 
informing the teaching-learning process except for the generation of marks 
for reporting purposes. At the secondary level in both provinces, a concern 
with preparing for evaluation in the future along with mark generation were 
important aspects of purpose. For the elementary teachers generally, the 
lowest rated purposes were those external to the teaching-learning process: 
to compare students’ achievement to others, to ensure students do assigned 
work, and to prepare for future evaluations. The lowest ranked purposes for 
secondary teachers in both provinces were concerned with diagnosis and 
reteaching: deciding what to teach next, diagnosing students’ weaknesses 
with the material, and allowing teachers to see how well the material had 
been taught. 

During the interview, teachers were asked to rate their evaluation in 
terms of criterion-referenced and norm-referenced purposes, both in terms of 
their own perceptions and what they thought their students would perceive. 
In the primary classrooms, the issue of norm-referencing does not generally 
arise. As the grade level in the sample increased, however, so did the report 
that the students would perceive the evaluations in a normative or in some 
combination of normative and criterion ways. At the secondary levels in both 
provinces, the teachers were inclined to see the instruments they developed 
as objectives-based, a view that is supported by the frequency with which the 
checking of the students’ progress against course objectives purpose was 
noted. These teachers recognized, however, that in some respects their stu- 
dents were right in seeing their results in a comparative way, at least in some 
limited respects. 

A final decision making issue concerned itself with what teachers and 
students were expected to do with the results of the evaluation process. 
Almost all the teachers in the sample reported a process of “taking up” or 
“going over” the completed instruments in class. At the elementary level, this 
process was frequently supplemented by encouraging the students to take 
the corrected instruments home. At the secondary level, students were more 
often encouraged to file them for use in final examination preparation. 
Several teachers at this level reported collecting the instruments so that they 
could be adapted for successive classes, and six secondary teachers reported 
that they had a retesting policy for students who had done poorly. 

As for the teachers’ own use of the results, the elementary sample often 
gave specific alterations in methods and content they would use the next 
time they taught that particular unit although the frequency of this type of 
comment decreased as the grade level increased. At the secondary level, 
particularly in British Columbia, there was little to no change expected in 
methods and content, although a few teachers mentioned that their expecta- 
tions for students might be altered. In Ontario at the secondary level, the 
teachers were more likely to make general comments about the pacing of the 
instruction, for example, than was the case in British Columbia, but still the 
overall sense was that the content and method were constant and stable. 
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Discussion 


The teachers’ concerns with measurement characteristics of the instruments 
they use focus on content validity and on centering the items to the students’ 
abilities. Elements that shape much professional evaluation practice—test 
blueprinting, item analysis, statistical manipulation of data—are virtually 
absent from classroom practice. Indeed, the search for objective measure- 
ment is deliberately offset by the frequently-expressed desire to match the 
measurement results to expected results. Items and tasks tend to be required 
that students are likely to be successful in, that look like tasks that were done 
in the regular flow of classroom work, and that support conclusions about 
individual and class achievement already made on other bases, including the 
results of other instruments. Such an approach inhibits the use of instru- 
ments prepared by others unless these can be adapted to specific groups of 
students. 

The judgment that flows as a result of this activity, then, is largely one of 
confirmation. With more objective measurements, discrepancy analysis could 
be used. But when the object of the measurement activity is to produce 
information that is congruent with what was previously known, then the 
possibility of discrepancies being noted is limited. The cases of teachers 
preparing students for government examinations are enlightening in this 
regard. 

Where the provincial examination mark is to be combined with classroom- 
generated marks, the expectation might plausibly be made that the class- 
room activities would concentrate on those objectives and activities that 
cannot reasonably be evaluated in paper-and-pen fashion. What actually 
happens in our sample of teachers is a tendency to imitate the provincial 
examination in virtually every instrument. Not only does this give the stu- 
dents practice in the types of questions featured in such examinations, but it 
allows the teachers’ judgments of the students to correlate highly with those 
emanating from the single, end-of-year assessment. 

A notable exception to this general, confirmatory pattern occurs at the 
primary and early primary levels. Here, the teachers were more likely to 
stress as major purposes of evaluation those that dealt with the teaching- 
learning process. They were also more likely than their later elementary or 
secondary colleagues to report using the information generated by the instru- 
ments to alter their teaching activities, particularly for individual students. 

It is also the case that this particular group of teachers operates in a policy 
and procedural framework that is quite different from that of their col- 
leagues. Report cards are more likely to be anecdotal and personal and less 
numerical and comparative. Outside examinations are infrequent or nonex- 
istent. Preparing for future schooling or work where school marks may be 
used as criteria is not relevant. Consequently, the purposes these teachers 
had for evaluation rated those appropriate for internal classroom purposes 
much more highly than those purposes were rated by secondary teachers. 

Where the outcomes of the assessments serve primarily to confirm judg- 
ments already made about relative achievement, there will be little that 
teachers can do to use the information for either self-improvement or in- 
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dividual student diagnosis. That this confirming purpose overrides these 
other potential uses is verified by the use made of the instruments’ results. 
They tend to be analyzed in terms of the rationale for the judgment, and then 
stored, to be used later as contributor to single, numerical summaries of 
individual achievement. 

Decision making, in the sense of making choices among courses of action, 
is not a plausible expectation when the dominant aim of measurement is to 
confirm. The rational decision making model depends on the possibility of 
divergent outcomes, with each outcome associated with a different course of 
action. The most sensible course of action to proceed with in the case of 
confirming information is to carry on as usual—which is precisely what most 
of the teachers in the study reported doing. 


Conclusion 


The evaluation of student achievement by teachers in classrooms occurs 
within a policy and procedural framework that is largely determined by 
outsiders. As other parts of our study are in the process of demonstrating, the 
department and school, the district or board, and the provincial authorities 
all weigh in with regulations and expectations that color much of the data- 
gathering activities teachers pursue. 

Where that does not occur so much, at the primary levels, the teachers 
tend to use their measurement and evaluation devices to comment on the 
growth of individuals and the success with which a particular unit or activity 
worked. Beyond those grade levels, however, the procedures for reporting 
especially mean that most teachers proceed with their evaluations primarily 
because they must produce marks, and these marks must be in a prescribed, 
usually norm-referenced, form. At the tertiary end of public schooling, at 
least in British Columbia, there is a further check on the congruence of the 
teachers’ assessments in certain subjects through a formal examination 
program. One certain effect of this program is to bring the teacher-generated 
marks in line with the examination outcomes. 

From early on in the schooling process, the formative purposes of evalua- 
tion begin to give way to summative ones, from internal classroom uses to 
external, public ones. One deleterious effect of this movement—if the teach- 
ers involved in this study are representative of their colleagues and their 
systems—is to encourage instructionally irrelevant evaluation, of replacing 
what might enhance the teaching-learning process with a mark-driven ritual. 
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Administrative Response to 
Classroom Testing Data: 
A Problem Solving Perspective 


A group of 15 practising high school principals were presented with a case study to examine 
their responses to classroom testing and the problems associated with tt. The case contained 
information on a district-wide report outlining inconsistencies in teachers’ evaluation 
practices, with a parental complaint concerning the evaluation practices of a staff member. 
Response protocols were categorized two different ways: first, into global areas of concern— 
student, teacher, parental complaint, district report, school department, school, district, and 
problem solving method; and second, by type—reasoning, action, reiteration, values, back- 
ground, summary, and comment. There was little evidence that the statement types and 
patterns of explanation bore a relationship to the-quality of actions individuals might take. 
The study may have been tapping differences in think-aloud ability rather than administra- 
tive ability. However, only about half the subjects related the district-wide study to the 
parental complaint. The study also examined the methodology used. Although the analysis 
began from a general skills perspective, context-specific elements had to be added to 
complete the task. Implications for the analysis of problem solving data are explored. 


Introduction 


The intent of this report is to set the issue of classroom testing and problems 
associated with it in the broader context of the school program and its 
ongoing maintenance. To accomplish this end, we presented practicing high 
school principals with a case study and examined their responses to the 
situation presented. The case contained information on a district-wide report 
outlining inconsistencies in teachers’ evaluation practices, as well as an 
immediate parent complaint about the evaluation practices of a staff mem- 
ber. 

In addition to dealing with the substance of the principals’ reactions, we 
also provide some detail on our methods, from cognitive science, of examin- 
ing the transcripts provided by our sample. Apart from questions involving 
this specific data set, we hope that our methodological comments are rele- 
vant to a broader issue in measurement, that of the systematic analysis of 
complex student achievement data. In particular, and very much in keeping 
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with a recent article by Perkins and Salomon (1989), we began our analysis 
from a general skills perspective and found ourselves needing to add context- 
specific elements to complete our task. The reasons for these modifications 
shed light on the analysis of problem solving data, whether the subjects be 
students, teachers, or principals. 


Background 


The rationale for this line of inquiry rests on three points: first, inconsisten- 
cies in classroom testing practice are a serious problem; second, the principal 
is a key figure in contributing to the solution of this problem; and third, our 
analysis methods, from the literature on problem solving, are appropriate for 
the task we have set ourselves. 


Rationale 


Testing Practice 
Measurement specialists have begun to make teachers’ assessment practices 
a target for examination. In an example of this line of inquiry, Traub, Nagy, 
MacRury, and Klaiman (1988) examined the assessment practices of high 
school calculus teachers, all from large, relatively wealthy school districts in 
the urban core of Southern Ontario. The teachers kept daily logs of their 
teaching and evaluation activities and provided copies of their tests, quizzes, 
and exams for an entire semester. In addition, they all marked the same set 
of student exams obtained from a school outside the study and provided by 
the research team. The results of the investigation showed considerable 
variation in teacher practice. Teachers differed substantially in use of class- 
room time, in the amount of student effort required, in relative emphasis 
given topics, in match between content taught and content tested, in meth- 
ods used for assessing student achievement, and in overall generosity in 
awarding marks. 

As well, Traub et al. (1988) have provided a convenient, although brief, 
review of related work. They report: 


1. Teacher-developed tests and classroom observations are the most com- 
mon methods of assessing students (Herman & Dorr-Bremme, 1983); 

2. Variations in assessment practices exist across subject areas and grades 
(Green & Stager, 1986; Terwilliger, 1987). For example, elementary 
teachers tend to favor observations, and secondary teachers testing 
(Stiggins & Bridgeford, 1985); and mathematics is heavily test-oriented 
compared with other subject areas (Wahlstrom & Danley, 1976); 

3. Teachers need training in teaching and assessing higher-order thinking 
skills (Haertel, 1986; Stiggins, 1988); and 

4. The measurement community needs to learn more about how assess- 
ment is conducted in the classroom by using more focused research 
methods such as classroom observation (Stiggins & Bridgeford, 1985; 
Stiggins, Conklin, & Bridgeford, 1986; Anderson, 1987). 

The Traub et al. (1988) results were incorporated in a case study pre- 

sented to a sample of principals in this study. The case study (Appendix A) 

contains a summary of the results obtained by Traub et al., modified only by 
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changing the number of schools involved from 17 to 15 and by placing these 
schools in one school district rather than 13. 


The Role of the Principal 


Current administrative theory assigns the task of curriculum leadership and 
management, including student evaluation practices, to the principal. 
Rutherford (1985) has outlined concrete aspects of effective school leaders. 
Among his indicators of effectiveness are the setting of clear expectations for 
both teachers and students, knowledge of individual classrooms, instruction- 
al support, and monitoring of progress. Leithwood and Montgomery (1986) 
identify four levels of principal effectiveness which focus, from lowest to 
highest, on the rules, the climate, the program, and the students. Among the 
factors under the principal’s rather than the classroom teacher’s control, 
they include assessment, recording, and reporting procedures. At the policy 
level, the expectation of the Ontario Ministry of Education (1988), stated in 
their recent manual on curriculum management, is that the principal will 
assume responsibility for achieving the goals of the curriculum. From both 
the theoretical and policy perspectives, a major role of the principal involves 
monitoring of the degree to which the objectives of the curriculum have been 
accomplished, that is, student evaluation and assisting teachers in the im- 
provement of their practice. 


Problem Solving 


Our strategy for analyzing the data collected in this study is rooted in the 
artificial intelligence approach to how people solve problems. Problems can 
be placed on a continuum from well-structured (logic puzzles, chess, and 
physics problems) to ill-structured (economic, social, and political problems). 
Well-structured problems are characterized by low linguistic load, little reli- 
ance on context, and widely accepted procedures and correct answers. III- 
structured problems are characterized by high linguistic load, much reliance 
on context, and debate about accepted procedures and solutions. Our focus, 
principals’ responses to case studies, is an example of an ill-structured prob- 
lem. 

The most immediate source for our analytic methods is the work of Voss 
and his colleagues at The Learning Research and Development Center, 
University of Pittsburgh (Penner & Voss, 1983; Voss, Greene, Post, & Pen- 
ner, 1983). They analyzed responses to protocols on the question of (ill-struc- 
tured) Soviet agricultural production problems, using parallel structures for 
problem solving and verbal reasoning. Examples of steps (“operators”) in the 
problem solving structure include “state subproblem,” “provide support,” 
and “summarize.” In the verbal reasoning structure, they categorized sub- 
jects’ responses as, for example, “state assertion,” “elaborate,” and “state 
qualifier.” 

Earlier work (Nagy & Allison, 1988), involving responses from a small 
sample of experienced principals to a different case study, provided an initial 
adaptation and refinement of the Voss et al. (1983) methods. In that analysis, 
Nagy and Allison first categorized statements from protocols into a set of 19 
categories (e.g., reasoning, subproblem, summary). They then drew diagrams 
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in which statements were grouped by subproblem; a system of indentations 
showed subordinate relations between statements. For example, a reasoning 
statement was indented once from the action it justified, and a subsequent 
elaboration of that reasoning was indented a second time. Finally, they drew 
highly simplified summary diagrams showing the pattern of subproblems 
identified and actions taken. 

In the present analysis, we began with essentially the same system. We 
found, however, that the detailed statement categorization caused us to dwell 
more on the language of the respondent than on the substance of his or her 
response to the case and that our category system failed to provide a basis for 
judgments of quality (e.g., for differentiating poor problem solving from 
better). Thus we greatly simplified the statement categorization system and 
paid more attention to the subproblems addressed by the principals and the 
specific actions that would have been taken. In retrospect, we question the 
extent to which methods involving categorization of statement types tap 
verbalization ability compared with problem solving ability. On this issue, we 
have some suggestions for refinement of the methodology. 


Method 


Sample 

The principals who participated in this study were chosen for convenience— 
essentially from personal knowledge and for geographical location. As about 
half came to our attention through supervisor or peer recommendation, the 
sample should probably be considered slightly more skilled than a random 
sample of experienced, practicing principals. However, apart from knowing 
that three of the 15 are in their first or second year at the principalship, we 
have no systematic way of distinguishing relative expertise. All interviews 
were conducted in the fall of 1988, in the offices of the principals, and at their 
convenience. 


The Case Study 


The hypothetical case study (Appendix A) contains two elements. The first 
element involves results of a research study recently completed in the high 
schools of the district. The study, using one calculus teacher from each 
school, showed large differences (a) in use of class time (e.g., seatwork versus 
direct teaching), (b) in amount of practice given, (c) in content covered, (d) in 
amount of evidence collected for grade assignment, (e) in match between 
content taught and content tested, and (e) in grades assigned to a common 
set of student exams. The study was reported to the board, where it oc- 
casioned some comment. The principal in the situation had previously asked 
his math head and department to discuss it, and they reported no problems 
in their school. The second element of the case study is a parental complaint. 
The student in question, reported by her father as having done well in math 
in the past, is depicted as having difficulties in calculus. The father wants a 
change from one teacher to another. In the scenario, the responding principal 
is asked to place himself or herself in the shoes of the principal in the case 
study and describe what he or she would do in the situation. The interviewer 
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answered clarifying questions, but no probes were used to explore areas that 
did not spontaneously occur to the responding principal. 

In order to understand the context, the reader might find it helpful to 
know that calculus is taught in Ontario high schools only in the final, the 
Ontario Academic Credit (OAC) year. For most students, this is the fifth year 
of high school. However, the system is designed so that the more able, if they 
are willing to take a “no-frills” curriculum, can finish high school with 
university entrance requirements complete in four years. For further con- 
text, there are no provincially set exams nor any equivalent of the Scholastic 
Aptitude Test in Ontario. High school grades are the major basis for univer- 
sity entrance, and they are determined entirely by the teacher. 


The Analysis 


In the initial analysis, response protocols were segmented into thought units 
and numbered sequentially. A “thought unit,” intuitively defined, was essen- 
tially a sentence; compound sentences were split, and items listed in a 
sentence were individually numbered. Next, these units were categorized 
into about 20 categories using a system adapted from Nagy and Allison 
(1988). Finally, diagrams were drawn to show the flow of the problem 
solution. In these diagrams, flowchart-like symbols for statement types were 
used, sequence was shown by vertical (downward) shifts, and change in 
subproblem area by left-to-right shifts. 

As described above, this approach was judged unsatisfactory because it 
dealt too much with the articulation of the problem solving and not enough 
with the substance. We felt that we were dissecting differences in oral 
communication ability rather than problem solving ability. More important- 
ly, it did not provide a systematic method of describing what each individual 
chose to do or not do in the presented situation. It left the distinction between 
conscious linking of subproblems and uncritical meandering between sub- 
problems to the unaided subjective judgement of the analyst. 

The revised analysis proceeded from two perspectives: that of the problem 
solving processes used, and that of the issues addressed. Two major changes 
from the first attempt were undertaken. First, the preliminary analysis had 
given us a sense of the range of areas addressed and actions taken. We made 
a decision to assign each statement uniquely to one of eight areas. (This 
required some resegmentation into much smaller thought units.) Second, 
the number of statement types was greatly reduced, from about 20 to just 
eight. Mainly, a large variety of subtle distinctions were ignored in favor of 
one larger category, “reasoning.” 

The analysis reported has four components: first, the emphasis given to 
each area; second, what was actually accomplished by each respondent; third, 
the statement types used; and fourth, the explicit value statements made by 
the respondents. Where possible, we attempt to relate these different views 
of the data to each other. 


Results 


The responses were segmented into statements. With one low outlier (Prin- 
cipal #1, 23 segments) and two high outliers (#4, 461 segments, and #6, 259 
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segments), the responses clustered quite closely around 100 segments. 

Preliminary analysis showed that the respondents dealt with eight different 

areas: 

1. Student—gathering information on the student, talking with him or 
her, or setting up assistance. 

2. Teacher—talking to the teacher, or getting information on the teacher’s 
performance (also includes any dealing with the receiving teacher). 

3. Parent/Complaint—talking to the parent, trying to resolve the issue; ex- 
cludes any information gathering needed for resolution of the conflict. 

4, Study—discussion of the study of assessment practices in the abstract; 
that is, not in the context of what the department, the school, or the dis- 
trict ought to do. 

5. Department—talking to the four teachers of calculus as a group, rather 
than the teacher whose student has complained; or talking to the depart- 
ment head; discussion of procedures the department ought to have in 
place, or information that ought to be available. 

6. School—discussion of school-wide issues related to the case; response 
had to involve explicitly other departments beyond the math depart- 
ment, or clearly talk of “departments” or “heads” in the plural. 

7. District—discussion of issues involving the elected board, the district 
central administration, or district level committees. 

8. Methods—any discussion of how to solve problems, or how to deal with 
conflict. 


GLOBAL AREA 


Student Teacher Par./Comp. Study Department School District Method 


ine) 
i aa 
Peis) 


oa 
Be == 
— = ia 


co 
== 
= 5 


ian =) a 0 
=F™ 
=f 


Figure 1; Percentage statements by global area 
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Figure 1 shows the degree of emphasis in each principal’s protocol on the 
eight areas. The sum of the widths of the boxes in Figure 1 for each principal 
is a constant, representing 100% of the statements given in response to the 
case. As can be seen, while all respondents paid some attention to the parent 
making the complaint, only two thirds of them talked directly to the student 
or the teacher. Only one third dealt with issues beyond the school boundaries, 
and of these (not shown in the figure), only one made a reference beyond the 
school district (i.e., to the effect that students had to compete with students 
from other schools for admission to university). Just over one half of the 
respondents discussed issues raised in the research study apart from the 
specific parental complaint, while only two dealt in a meta-analytic fashion 
with problems or conflict. All but one of the respondents discussed depart- 
mental procedures. A few of the respondents (#3, #14, #15) can be seen to 
have addressed only a limited number of the eight areas, but most dealt with 
five of the eight. 

Table 1 gives a matrix of what was accomplished or addressed by each 
principal. These accomplishments do not have a one-to-one correspondence 
with the actions taken; several actions might be undertaken for one ac- 
complishment, for example, to find information on the student. As well, there 
are only seven categories in Table 1, compared with eight in Figure 1; no 
aspect of the discussion of the study could be considered an accomplishment 
apart from accomplishments involving the teacher, the department, or the 
school. An examination of Table 1 shows that only six of the 15 responding 
principals would have checked to see if the information from the parent on 
the student was accurate. However, eight of 15 set up some type of assis- 
tance, either academic or personal, for him or her. Except for two respon- 
dents, there was generally less investigation of the individual teacher than of 
the student. Only about one half the sample tried to find out what the 
department did to monitor or control teacher practices with respect to the 
topics examined in the district study. In effect, such investigation shows a 
clear linkage between the individual complaint and the general issue. Thus 
one half the sample treated the study and the complaint as only marginally 
related. 

The information in Table 1 sheds light on how the principals felt about the 
issues raised in the reported study of calculus teacher practices. Three re- 
spondents talked of dealing with the specific complaint within the school. 
Two of these individuals were concerned to avoid the imposition of a solution 
(to the general issue rather than the complaint) from outside the school, 
while the third talked of the political ramifications of having a hostile parent 
go to the district administration. Only three respondents stated that the 
district office needed to be involved in resolving between-teacher inconsisten- 
cies. 

To further explore how respondents felt about the issues, we isolated their 
value statements (see Appendix B). About half stated that the promotion of 
uniformity of marks within the school was important, but only two spoke 
specifically in favor of common exams, even within the school. Most talked of 
ongoing communication among teachers as the main vehicle for the promo- 
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tion of uniform marking standards. Also, in only a couple of cases is it explicit 
that the motivation for achieving consistency is fairness for students rather 
than the avoidance of complaints. 

As a final step in our analysis, we categorized statements according to 
type: 

Reasoning—a general term including several categories of the original anal- 
ysis: identifying subproblems, pointing out assumptions, citing information 
from the case, identifying missing information, predicting, explaining an ac- 
tion, analyzing the case; Details of Reasoning statements are tabulated 
separately. 

Action—specific statements of what the respondent would do; again, Details 
of Action statements are tabulated separately. 

Reiteration—simple repetition, which was distinguished from attempts to 
provide a summary. 

Values—statements of what the respondent considers important about as- 
sessment, students, or education; where detail or elaboration was provided, it 
was not tabulated separately. 

Background—relating the case to the respondent’s personal experience; in 
two situations, a respondent either began or ended the discussion with a 
lengthy personal anecdote; these were simply not analyzed. 
Summary—summaries of previous steps. 

Comment—statements about the process itself, including both throwaways 
(“Where was I?”) and substantial meta-analytic statements (“I like to lay out 
the steps in a problem solving process before I begin.”). 

Figure 2 gives a breakdown of the statement types used by each respon- 
dent. Again, the sum of the width of boxes for each respondent is a constant, 
representing 100% of the statements. For Reasoning and Action, the bar 
down each box distinguishes Reasoning or Action, on the left, from Detail, on 
the right. In the two cases with no bar, there was no detail provided. One 
interesting comparison to make in Figure 2 is between Reasoning and Ac- 
tion. For about one half the sample, there were more than twice as many 
Reasoning statements as Action statements. 

There are several other patterns within Figure 2, but their relationship to 
the quality or nature of response to the case is not clear. Voss and his 
colleagues (Penner & Voss, 1983; Voss et al., 1983) suggest in their research 
that at least two factors differentiate the expert from the novice: experts 
isolate more abstract or general problems as being the primary factors 
producing a given problem, and experts show more skill in providing suppor- 
tive reasoning for their stated solutions, in stating subproblems which may 
emerge from the solutions, and in evaluating solutions in relation to con- 
straints. Neither of these patterns is apparent here, however. 

Subproblem identification in our protocols was frequently implicit rather 
than explicit. None of the respondents consistently articulated or defined 
subproblems associated with the case; rather, subproblems were implicitly 
identified in the stated solutions or action plans. Hence a category for sub- 
problem identification has not been included as one of the seven statement 
types. Instead our analysis focused on the matrix of activities and the areas 
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Figure 2: Percentage statements by type 


commented on by the respondents. We found that everyone dealt with the 
immediate problem of responding to the parent, but only half dealt with the 
broader or more abstract issues concerning the study or the ramifications for 
the school, and only one third mentioned the school district. However, of the 
three novice principals (those with less than three years experience), one 
(#2) talked of all three areas, and another (#4) talked of two of them. 

In terms of reasoned arguments, some respondents reiterated and back- 
tracked more than others; some provided periodic summaries; others related 
their actions strongly to their own school situation; and some expressed 
many more values than others. Again, however, there was no correlation 
between the extent of these activities and the experience of the principal. In 
fact, a novice (#4) was one of those who employed substantial supportive 
reasoning. Moreover, in our subjective opinion, more reasoning is not nec- 
essarily better; some respondents engaged in critical analysis of the problem 
and others in uncritical meandering. The present analysis by statement types 
does not differentiate between these two. 


Discussion 


Several issues, both methodological and substantive, arise from this inves- 
tigation. With a reminder that we believe the analysis of complex problem 
solving data should be of interest to the measurement community, we begin 
with comments on our own methodology. 

First, there is little evidence in the present data set to suggest that 
statement types used and patterns of explanation bear a relationship to the 
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quality of action the individuals might take when placed in a situation similar 
to the presented case. We are at this point unable to relate differences in 
ability to articulate one’s views to ability to run a school successfully. Similar- 
ly, there is little reason to believe that patterns of statement use, such as in 
Figure 2, can be related to substantive issues involving student assessment 
attitudes and practice. One explanation, which we are currently pursuing, is 
that we have a restricted-range problem. Research on expert-novice differen- 
ces (e.g., Patel & Groen, 1986) has shown differences in expert use of 
“forward reasoning” and novice use of “backward reasoning.” By such a 
criterion, all our sample were experts. We may indeed find the data on 
statement type useful in differentiating between responses from practicing 
principals and true novices such as student teachers, or less experienced 
classroom teachers. 

In addition, we felt the need to introduce Table 1, which amounts to a 
subject specific “marking scheme,” the subject being school administration. 
This is in keeping with Glaser’s (1984) argument for the importance of 
specific knowledge over general skills. Using a content-based system, we can 
distinguish levels of success at the task. However, as Perkins and Salomon 
(1989) point out, more atypical problems might better allow differentiation 
between experts and novices in general skill (i.e., the approach in Figure 2) 
rather than subject-specific skill (i.e., the approach in Table 1). Perhaps our 
case study was not novel enough for our sample. At the time, we found 
comments such as “this is a real problem—it happened last week” quite 
gratifying, showing that we had not lost touch with school realities. However, 
a truly novel problem may have allowed differentiation in terms of general 
skills as well as subject-specific accomplishments. 

Third, we felt discomfort that we were tapping differences in verbaliza- 
tion skills rather than school administration skills. This suggests develop- 
ment of a substantial set of probes to compensate for verbalization ability. 
With such probes, we would still be able to distinguish what is spontaneously 
offered by respondents, but we would be less in the dark about all the things 
not said. That is, we would know if a given respondent had actively chosen 
not to take some action or had merely forgotten to mention it in the discus- 
sion. As Nagy and Allison (1988) pointed out, our experts are experts at doing 
things, rather than, as were Voss’ experts, at talking about things. However, 
this argument cannot be carried too far; one might reasonably argue that 
principals earn their living with words as much as do academic agricultural 
experts. 

On the substantive side, we found reasonably consistent desires among 
our sample for grading consistency, but large differences in how large a role 
to play in achieving this consistency. Only a minority took an active hand in 
departmental discussions; most left it to the department chair. This view is 
not in keeping with the view of the principal as curriculum leader. However, 
it does seem to represent current practice in Ontario high schools. 

Second, and this may be a methodological point poorly disguised, we have 
tapped the “craft” of administration. Our subjective impression is that our 
group of respondents, all successful by different criteria, actually differ less 
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in administrative ability than our results show. This suggests that we need to 
explore the importance of rationality—the requirement for success at our 
task—to actually running a school. 

Finally, and perhaps most important for classroom assessment, only a- 
bout one half our sample actually treated the study described in the case 
study as relevant to the parent complaint. This is perhaps the most impor- 
tant food for thought for those involved in the improvement of teachers’ 
assessment skills. 
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Appendix A 
Student Evaluation Case Study 


Background 


Smithson Collegiate Institute (SCI) serves a predominantly affluent, profes- 
sional suburban community; however, there is one feeder school in a lower 
middle class neighborhood with subsidized housing. There are just over 1,400 
students, substantial numbers of whom go on to university after graduation. 
Mr. Lesley is a recently appointed principal to Smithson Collegiate, just 
beginning his second year. 

A district research study on student evaluation in calculus classes has just 
been completed and a summary sent to each secondary school, along with a 
memo from the assistant superintendent of program requesting that schools 
review the results and their implications. Asummary of the 80-page report is 
appended. 


The Issue 


When the results of the study were presented at a board meeting, there was 
considerable discussion. One influential and experienced trustee stated that 
he expected that the situation was even more serious among less experienced 
teachers and in more difficult to evaluate subject areas. 

Mr. Lesley sent this information to the math department head, Mr. 
Addison, who has held that position for 11 years, the past six at Smithson 
Collegiate. There are four calculus classes in the school, with four different 
teachers. The calculus course coordinator is a senior teacher with an excel- 
lent reputation among his peers for the quality of his teaching. The other 
three have several years of experience and are regarded as being competent. 
Mr. Addison told Mr. Lesley that he was not aware of any problems such as 
those mentioned in the report. Mr. Lesley asked him to raise the issue at the 
next department meeting and to keep him informed. The head agreed. 

All department meeting minutes are routinely sent to Mr. Lesley, and the 
next set for the math department meeting included an item on the calculus 
study. Mr. Lesley noted that after some discussion, the teachers involved felt 
that the problems reported were not significant at SCI because there was a 
common calculus exam and a policy of no exemptions. The department head 
appended a note saying that, although there were differences of opinion 
regarding teaching emphases, he felt strongly that the calculus classes were 
well taught by experienced staff members, each of whom had individual 
strengths, and that any attempts to impose greater uniformity would be 
counterproductive. 

A week later, an angry parent demanded to see Mr. Lesley about the 
calculus class his daughter was enrolled in. The teacher, he said, had a 
negative attitude toward all the students who weren’t especially talented in 
mathematics. His daughter was an “A” student who worked hard and had 
always achieved well, but was finding calculus difficult. The parent accused 
her teacher, Mrs. Peerless, of not having enough patience with those students 
who needed more time for practice and review of the concepts. The parent 
was indignant and demanded that his daughter be switched to another 
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calculus class taught by Mrs. Lovejoy, who had a reputation for being en- 
thusiastic about the subject and whose students always seemed to do well. 
You are Mr. Lesley. How will you handle the situation? 
Please think out loud as you consider this situation and your reactions to 
it. 
Report Summary 


The purpose of the report was to explore the meaning of grades assigned by 
different teachers of the same calculus course. The participants were 15 
teachers, one from each high school in the board. All participants were highly 
qualified and experienced teachers of the course. The data collected were as 
follows: 


1. a log giving the classroom activities each day and the time devoted to 
each; 

2. a list of the homework and seatwork assigned each day; 

3. alist of the criteria used to arrive at student grades for the course; and 

4. grades assigned by the teachers to a common set of 20 final exams ob- 
tained from a class not involved in the study. 


Great differences in practice were found. Among the teachers, the amount 
of homework and seatwork assigned varied greatly. When all activities listed 
in the logs were classified into one of six categories, variations in time spent 
in each activity were as follows: administration, from 0% to 5%; direct 
teaching, from 17% to 52%; student practice, from 8% to 47%; homework, 
from 11% to 43%; review, from 4% to 14%; and assessment, from 8% to 16%. 

Teachers who placed a relatively high emphasis on practice activities 
tended to place a relatively low emphasis on review and assessment ac- 
tivities. Moreover, those with more class time available expended a smaller 
percentage of time on direct instruction and allocated a greater percentage to 
homework and practice. 

From the reports of work assigned, the results showed that teachers 
varied widely in the number of questions assigned as homework—from under 
500 to more than 1,600. Substantial differences were also found among 
teachers in content emphasis. For example, the number of questions on basic 
skills ranged from 40% to 75% of the total number of assigned questions; the 
number on proofs from 0% to 18%. Similar variations were found in the other 
content groups. When content of what was taught and what was tested were 
compared, there were irregular patterns in topic emphases. The emphasis on 
basic skills was less on the tests than the assignments, whereas the emphasis 
on other content groups was greater on the tests than on the assignments. 

The analyses of the grading practices revealed large differences in the 
type of evidence collected by teachers for their grades. Examinations and 
term tests were the two main determinants of student grades. For the four 
classes (of 15) following an exemption policy, the majority of students took 
their only exam on material learned in the first half of the semester. For six 
other classes, the only exam was a final exam based on the entire semester's 
work. In the remaining classes, both a midterm and final exam were re- 
quired. Across teachers, the final examination mark was weighted from 15% 
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to 40% of the student’s final grade and, when a midcourse exam was ad- 
ministered, the resulting mark was weighted 9% to 30% of the final grade. 
Work in the first and second halves of the semester were weighted equally by 
five of the teachers, and the second half was weighted more heavily (70%) by 
the other 10. 

The study of teacher marking of the common set of 20 exams revealed 
widely different expectations of student performance for marks awarded, as 
seen in substantially different marking schemes for the same questions and 
in the existence of “easy” and “hard” markers. Although there was substan- 
tial agreement among the teachers as to the relative importance of the 
examination questions and the relative quality of the student papers, there 
was substantial disagreement as to the absolute quality of the student 
papers. 

To illustrate, three teachers assigned no paper a mark in the honors 
range, and one teacher assigned failing marks to seven papers; on the other 
hand, seven teachers assigned no paper a failing mark, and one teacher 
assigned marks of 80 or more to 10 of the 20 papers. 


Appendix B 
Value Statements 


Principal 1: Department—promote conformity; If not, they should be 
[common tests, exam], in other words one teacher sets the first test, another 
one sets the second one, and so on. 

Principal 2: Department—promote conformity; It [the school] should 
[have an overall evaluation policy]. Department—study-like data; That isn’t 
good enough [accept department’s assessment]. Department—promote con- 
formity; I don’t want one class doing 8% and another class 40%. 

Principal 3: Department—promote conformity; My particular philosophy 
is that parents and students have a right to feel that in one particular course 
where there are three or four or five different instructors that there is some 
continuity and some consistency of practice and evaluation. Department— 
promote conformity; While we are not trying to make robots of the teachers, 
we do feel that similar courses with similar evaluation practices should more 
or less resemble each other. Department—promote conformity; And while 
there will be some small difference in judgment, there is professional judg- 
ment, but that judgment should not be completely out of wack or based on 
entirely different tests that were given or different methods of evaluation. 

Principal 4: School—transfer info; I think, first of all, you undermine 
confidence in your teachers, period, if you’re willing to move them around 
like players on a chess board. Teacher—info; I think that’s a principal’s 
responsibility to see that the teachers’ doing what they’re supposed to be 
doing. Other; I’m not a believer in, first of all, undermining the teaching 
staff's strengths, period. Student—personal assistance; I am not a fan of 
personality conflicts. I don’t allow it ... It’s not something I encourage. 

District—system policy; We feel different about final exams than they 
[other schools in board] do and I think you have to allow a certain amount of 
flexibility, but there has to be some check. District—avoid system exams; You 
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don’t need examinations, maybe [in a school where focus is on business or 
tech] and you certainly don’t need standardized testing in calculus for the 
number of kids you are dealing with. District—avoid system exams; And I 
think you ought to keep that independence in a school. District—avoid 
system exams; I think you ought to be able to be able to address the needs of 
your kids. Other; The grade 9 and 10 teachers should be dealing with the 
elementary feeder schools. Department—promote conformity; And I think 
this common set of exams, again it [discrepancy] shouldn’t happen within a 
school. Department—promote conformity; There should be a marking 
scheme that’s common. Department—promote conformity; There should be 
a common exam for the four sections in the school, if there are four teachers. 

Principal 5: School—contain problem; And what you always want is to 
have the first avenue of appeal within the school. 

Principal 6: Department—promote conformity; My feeling is that I would 
expect to see a relatively high degree of commonality in such things as 
courses of study, expectations of students, evaluation techniques, textbook 
being used, what is the purpose of evaluation, what methods are used, how 
much sharing of evaluation techniques, right through the whole year. 

Principal 7: Other; I don’t think you’re going to get much support or much 
trust from teachers of any subject level working together to bring about 
betterment for the system if you’re going to compare one school to another, 
or for that matter, even teachers within that school. 

Principal 8: No value statements. 

Principal 9: Department—don’t promote; I don’t think the solution is to 
demand that the daughter be switched to another calculus teacher. District— 
system policy; The department heads across the county should have some 
broad general standards ... Department—study-like data; and the depart- 
ment head in this particular school, when slicing that calculus pie into four 
pieces, should have some internal way to monitor the stresses in the cour- 
ses—do we stress seatwork, do we stress the daily testing, do we have any 
assessment as opposed to evaluation, do we have make-up opportunities? ... 
master learning? ... chance to do [test] again? Department—promote con- 
formity; There should be some understanding between these people so that 
we eliminate many of these differences ... 

Principal 10: Other; I think you try and treat teachers ... treat them with 
respect. 

Principal 11: No value statements. 

Principal 12: Student; Whatever decision is made would have to be made 
in the best interest of the student, regardless of how the parent feels, and 
regardless of where the parent is coming from, and regardless really of what 
the teachers feel. 

Principal 13: Teacher—attitude info; Sometimes that [teacher not being 
sensitive] happens knowingly and that probably bothers me more than any- 
thing else. 

Principal 14: Department—promote conformity; I think in a common 
public educational system we need to be very very much aware of this and do 
whatever we possibly can to reduce that variance that could occur. Depart- 
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ment; You’ll never, I don’t think, get exact uniformity ’cause there are 
different teaching styles there. 

Principal 15: Other; I believe that no matter where we are we can con- 
tinue to improve and make things more effective. Department; I don’t feel 
that everybody has to teach lock-step, that we’re all doing all the same thing 
all of the time, because I think that certainly stymies individuality. Depart- 
ment—promote conformity; I also believe that there’s got to be a core, that 
we are doing basically a similar thing, especially with respect to evaluation ... 
Promote conformity; I think it’s only fair that there has to be a degree of 
continuity not only within a school but even within a system. Department— 
don’t promote; I believe that we can’t run away from a timetable from one 
person to the next. Student; I believe after thorough investigation if we have 
a personality conflict or a learning situation that is being stymied, then 
perhaps a change may be possible. 
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From a Horizontal to a Vertical Method 
of Integrating Educational Diagnosis 
with Classroom Assessment 


This paper is based on two Quebec studies which are part of an ongoing attempt to integrate 
educational diagnosis with class assessment. The underlying rationales, objectives, me- 
thods, and results of each research are described. Then, based on the limitations of the first 
study, as well as on new developments in the field of mastery learning, a mixed paradigm is 
outlined. It is argued that the unit of measurement should shift from the student to the 
teacher-student interaction, with the resultant proposal to change the methodology from a 
“horizontal” to a “vertical” one. 


In the early 1980s, the Ministry of Education of the Province of Quebec began 
to implement a new policy concerning the evaluation of students in the 
primary, secondary, and college levels. Inspired by the recent developments 
in formative evaluation and mastery learning (Birzea, 1982; Block, 1971; 
Bloom, 1982; Bloom, Madaus, & Hastings, 1981), it emphasized the impor- 
tance of student evaluation which became the primary responsibility of the 
teacher. 

The ensuing reform of the principles and programs guiding the system 
made it necessary for the teacher to collect more information regarding 
students’ learning and development and to do so more precisely than in the 
past. However, as this pedagogical reform was being gradually implemented, 
it became clear that the obvious problem of the quality of the information 
needed was compounded by the lack of integration of the “formative” tests 
with a global pedagogical strategy founded on the principles of formative 
evaluation. 

This article addresses the problem of the integration of the theory and 
practice in this domain by presenting the objectives, methods, and results of 
two studies conducted in different Quebec school boards over a four year 
period beginning in 1983. 

The common main goal of the two studies was to contribute to the 
elaboration of a model of diagnostic assessment destined to help guide the 
various educators (such as teachers, principals, and counselors) in the devel- 
opment of learning strategies based on formative evaluation. Each of the two 
studies focused on a specific research paradigm based on two different per- 
spectives of classroom assessment, one centering on diagnostic testing as 
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conceived by specialists, and the other focusing on classroom assessment as 
practiced by teachers. 

The first study, Elaboration et application d’un modéle de diagnostic 
pédagogique des difficultés d’apprentissage intégre un processus d’évalua- 
tion formative, focused essentially on assessing the efficiency of a heavily 
instrumented form of mastery learning approach. The second and current 
study, Elaboration d’un modéle d’intégration du diagnostic pédagogique aux 
apprentissages scolaires,” focuses on developing and validating a model of 
integration of educational diagnosis (diagnostic pédagogique) in the class- 
room setting. 

The objectives of the first research were: 


1. To analyze the different formative evaluation models; 

2. To analyze the different methods of test development specific to educa- 
tional diagnosis; 

To develop a model of evaluation of the students learning difficulties; 
To validate this model in a school setting; and 

To develop a measurement strategy necessary to apply the model ina 
school setting. 


Olea 


The second study, which is presently in its initial stage, aims at: 


1. Defining an effective research paradigm centered on the classroom; 

2. Reassessing the previous model in the context of the real curriculum as 
practiced in the classroom; 

3. Reformulating the model so that diagnostic assessment becomes more 
consistent with classroom constraints; 

4. Focusing on long-term effects of the model; and 

5. Exploring the feasibility of using classroom-based computerized diagnos- 
tic item banks. 


Conceptual Framework 


In both studies, formative evaluation is defined as any continuous evaluation 
practice intended to contribute to the improvement of learning in progress 
(Perrenoud, 1986). This definition has the advantage of not being limited to 
a given prescriptive model. In Canada, as in many other countries, we note in 
the past 20 years a growing number of official endorsements of the principle 
of formative evaluation. Whether in the form of published and widely dif- 
fused government regulations, as in Quebec for example (MEQ, 1979), or 
under different forms, the principle is widely recommended in policy state- 
ments. 

However, moving from intent to practice has proved quite difficult. This 
problem has been addressed abundantly both from a theoretical and a practi- 
cal point of view (Allal, 1983, 1988; Bloom,1984, 1988; Cardinet, 1984, 1988; 
Scallon, 1988). At the core of all the mastery learning models one finds 
“feedback-corrective procedures” loops. The resultant “diagnosis” is often 
confounded with the various pedagogical interventions used; it is therefore 
intimately linked to both the fundamental cognitive processes involved in 
learning and to the interventions themselves. Hence the meaning and indeed 
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the definition of diagnosis, reflect the complexity of any pedagogical inter- 
vention. 

In this article, we adopt an operational definition of educational diagnosis 
proposed in the context of computerized diagnostic assessment (Dassa, 1988) 
which allows one to take into account the applied nature of its integration to 
classroom assessment. In order to carry out an educational diagnosis one 
needs to: 


1. Identify the error, that is, in the present classroom context, what the 
teacher identifies as an incorrect response; 

2. Describe the nature of the error; 

3. Describe the pedagogical context; 

4. Propose a cause or a set of probable causes for the errors; 

5. Attempt to link the cause(s) to a learning paradigm; 

6. Propose corrective interventions coherent with both the teaching and 
technological approaches used and the classroom’s ecology; 

7. Assess the effectiveness of the corrective measures undertaken; and 

8. Decide whether further remediation is needed: if not, end the diagnosis; 
if yes, decide on a further course of action. 


The objectives of the first study were reached in three phases: 


1. Determination of the state of the problem by means of extensive litera- 
ture review and exploratory visits to numerous schools; 

2. Elaboration of a diagnostic model and of a pedagogical guide for 
teachers; this phase also included a first empirical validation involving 
trial runs of all pedagogical material and tests; 

3. Modification of the initial model according to the results obtained in the 
second phase; further validation of the model using a quasi-experimen- 
tal design. 


In this section we will focus mainly on the process of validation of the 
model (phase 3). Before presenting it in detail, we briefly summarize the 
model in its final form. As with most mastery learning models centering on 
corrective approaches, it is best described as an iterative process: 


1. Determine an adequate structural hierarchy of a given learning unit; 

2. Verify the affective and cognitive prerequisites; 

3. Describe and analyze the difficulties students have and attempt to iden- 
tify their causes; 

4. Apply appropriate corrective measures; 

5. Proceed with teaching according to a version of pédagogie de maitrise 
(mastery learning according to Bonboir, 1970 and Allal, Cardinet, & Per- 
renoud, 1979); 

6. Proceed with the next learning unit. 


Essentially, the final validation consisted of comparing within a school 
setting the effects of various levels of integration of the diagnostic to teach- 
ing. The choice of these levels was set by the results of phase 1. In short, it 
was found that most schools use the Ministry’s directives and documents in 
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an informal and uncontrolled way. Typically, the teacher is given lists of 
content-related objectives and then proceeds more or less as he or she wishes. 
In principle, teaching should take place according to the mastery learning 
approach as outlined in rather general terms in various Ministry documents 
available through the school board. Some teachers ignore these documents 
and teach in a traditional way. This situation characterized the “control” 
group. Two experimental conditions were then created to reflect: (a) the type 
of support the Ministry is considering offering (experimental 1); and (b) the 
maximum type of support that the Ministry, in conjunction with the local 
school board, would appear to be able to offer (experimental 2). 

The teachers in the experimental groups were given teaching modules 
custom-made by the research team with the collaboration of the school 
boards to fit their programs and academic calendars. These learning units 
(LU) were developed according to the following criteria: (a) representa- 
tiveness, the LU is a significant part of the program; (b) independence, the 
LU does not require highly specific prerequisites; (c) teachability, the LU 
should not present an unusual teaching problem; and (d) feasibility, the LU 
should be teachable within the normal school schedule. 

The learning units consisted of teachers’ books complete with a guide 
detailing the objectives; the rationale for the sequence of subunits (lessons); 
detailed directives for students on how to use the material, namely, a series 
of “activities” used in each subunit; and a series of diagnostic tests. The 
experimental groups were differentiated by asking the teachers of the ex- 
perimental 1 group to interpret the diagnostic tests by themselves, whereas 
the teachers of the experimental 2 group were given personal detailed in- 
structions on the use of the diagnostic tests as well as all the assistance they 
required in interpreting and using their results. 

The study was limited to grade 6; at this level Quebec schools tend to 
emphasize the consolidation of previously acquired topics. This contributed 
to increasing the independence and teachability of the LUs. Two LUs were 
especially developed for the experiment, one in French (as mother tongue) on 
homonyms and one in mathematics on the multiplication of numbers with 
decimal points. 

Basically, the experiment was framed as a classical quasi-experimental 
design with repeated measures obtained on a large number of variables. The 
data originate from 33 of 36 grade 6 classes of the school board of Saint 
Jerome, a large town 50 km north of Montreal. The classes, comprising 854 
students and 33 teachers, were assigned to 3 groups corresponding to the 
three levels of experimentation. The unit of assignment was the class and the 
system of random allocation controlled two factors: the location of the school 
(downtown or rural) and the socioeconomic status of the students attending 
the school (based on a socioeconomic study conducted by the school board). 

All students were given a battery of pretests and posttests in mathematics 
and in French, a mathematics test on the prerequisite basic skills, a self-con- 
cept scale, and attitude scales for French and mathematics teaching. In 
addition, the students of the two experimental groups were given 10 diagnos- 
tic tests in mathematics and 6 in French; each lesson was followed by a short 
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paper-and-pencil quiz or by an oral activity (games, questions and answers, 
etc.). These tests covered the material of the 10 subunits in mathematics and 
the 17 (shorter) subunits in French. The experiment lasted about a month 
during which time the research team was present throughout the day in 
Saint Jerome mainly to respond to any request for assistance expressed by 
the teachers of the experimental 2 group and to keep track of relevant 
information pertaining to each class. 

At the end of the experiment all participating teachers responded to a 
questionnaire. It focused on: (a) detailed reports on their use of pedagogical 
material and evaluation procedure used, (b) their reported degree of satisfac- 
tion with material and procedure, (c) their recommendation on material and 
evaluation procedure, and (d) their attitudes toward the different compon- 
ents of formative evaluation. 

A univariate analysis of variance was performed separately on each vari- 
able of the pretests and posttests as well as on gain scores. This was followed 
by an analysis of covariance. The interactions between the affective and 
cognitive variables were investigated by a two-factor hierarchical design. 
Finally, the teachers’ responses to the questionnaire were first analyzed in a 
descriptive way. These data were then combined with the information ob- 
tained by direct contact with the teachers during the experiment. 


Summary of Results 


It was found that the experimental groups generally presented higher 
achievement in French and in mathematics. Furthermore, the experimental 
2 group was significantly different (higher) than the experimental 1 group in 
mathematics. In all cases there was a significant reduction of individual 
differences in the achievement variables for the experimental groups, as 
opposed to a nonsignificant (very slight) reduction for the control group. The 
effect size indices (Glass, McGaw, & Smith, 1981, p. 210) obtained for the 
experimental groups were equal to 0.56 for both French and mathematics. 

The students’ attitude had improved significantly only in the experimen- 
tal 2 group and only in French. No significant correlation between attitude 
and academic achievement was found. In general, the LUs were well per- 
ceived by 70% of the teachers and students alike. However, about 30% of the 
teachers thought that the time required by the LUs was excessive. A majority 
(about 70%) of teachers thought that formative evaluation is useful and 
accepted the pedagogical consequences: verifying prerequisites, identifying 
difficulties, and remediating. However, it is worth noting that about 60% of 
the teachers thought that learning difficulties do not depend on the quality of 
teaching. Furthermore, about 25% of the respondents disagreed with the 
necessity for more personalized teaching. 

Finally, the teachers of the experimental 1 group (who were not given 
personal detailed instructions on the use of the diagnostic tests) reported 
toward the end of the experimental period a change of attitude among some 
students with respect to the mode of testing. The gradual disenchantment of 
the students with the testing session, for both French and mathematics, 
seemed to express a rejection of the paper-and-pencil technology: Comments 
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of the type “Oh! not tests ... again!” seemed, according to some teachers, to be 
the typical way of greeting a session. Though the experimental 2 classes as a 
whole did not report such a “rejection” (to quote a teacher of experimental 
group 1), some of them, typically those who did not vary their instruction 
strategies, reported a gradual disenchantment of the students with the test- 
ing sessions. 


Moving from Horizontal to Vertical Strategy 


The limitations of Study 1 stem from a number of interrelated factors. The 
first is substantive in nature and is particular to all mastery learning models: 
Because the experiment lasted a little over four weeks, one should raise the 
question of the long-term effect of the application of the model studied. 
However, it should be noted that our study, which showed a 0.58 standard 
deviation effect, fared relatively well among the more comparable ones 
reported by Slavin (1987) in his meta-analysis of mastery learning research. 

The second limiting factor concerns the method used. Study 1 was framed 
as a heavily instrumented traditional quasi-experiment based on the largest 
sample of subjects possible in order to ensure an adequate representativeness 
of the population of subjects. As a trade-off for generalizability with respect 
to the subjects, a severe limitation was put on the nature and scope of the 
curriculum as well as on the information pertaining to the teaching and 
evaluation naturally occurring in the classroom, the “real” curriculum. 

The third and more general factor is epistemological in nature. Because 
the feedback-remediation loop is so central to the formative evaluation of 
learning in a teaching context, the errors made by the students have to be 
placed in their proper context. First, let us recall that an error is in practice 
defined as such essentially by the teacher. Hence errors express, at least 
partially, the type of relationship to knowledge that is gradually being built 
for the student. That is why corrective measures, in order to be effective, 
need to be related to the dynamic description of errors as well as to their 
fundamental underpinnings (statut de l’erreur). The corresponding diag- 
nosis is therefore closely linked to the way the regulatory loop is put into 
practice or interpreted by the teachers, the school system, and for that 
matter by the researcher. 

Essentially, the typical mastery learning approach starts with the for- 
mulation of objectives so specific that a large number of them are needed in 
order to satisfy curricular requirements. Cardinet (1988) has recently criti- 
cized the resultant “atomistic” evaluation approach. Even though the so- 
called “intermediate” (as opposed to “terminal”) objectives used in some 
school systems, such as in Quebec, may reduce the problem of inflation of 
objectives, the fact is that their choice remains essentially arbitrary. Neither 
the inflation nor the reduction of the objectives seems to be effective: When 
learning is measured precisely, then the number of objectives is typically very 
large and their choice is arbitrary; however, when a small number of terminal 
objectives are formulated, they tend to be too general to lend themselves to 
measurement. 
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A similar contradiction exists for setting the mastery criteria. Assessing 
an achievement as mastered or not mastered refers to the difficulty of 
correctly interpreting the results, of attributing a meaning to a given perfor- 
mance and thus of creating the proper frame of reference in terms of at- 
tempting to identify the cause of the difficulty experienced by the student. 
Whatever the reality of the evaluation performed, it is atomized in myriad 
measured entities, supposedly forming the basis of a diagnosis independent 
of the learning context and, a fortiori, from the classroom ecology. 

Again, Cardinet (1988) showed how the prerequisites are, in this mecha- 
nistic interpretation of the regulatory loop, subject to the same criticism. This 
strikes at the core of the main operational device used in mastery learning: a 
sequence of objectives defining a hierarchy which is at best only suitable for 
certain types of learning. The nonlinearity implied by many language cur- 
ricula as well as the need to restructure or construct certain mathematical 
representations are well known. 

Furthermore, the diagnosis becomes problematic when conceived as an 
entity by itself, without any tie to the classroom. Indeed, its causal com- 
ponent could in some cases be useless. A simple example illustrates this point. 
In the same way as one could cure a bad case of flu by staying in bed and 
drinking plenty of fluids without ever identifying, or even attempting to 
identify, the nature or the physiological mechanism of the viral infection in 
order to cure it, a teacher could correct a difficulty by just repeating, and 
eventually changing, the didactic approach used. Of course, in some cases 
remediation without identifying the causes of the difficulties is next to impos- 
sible. Needless to say, persisting in this search raises the problem of allocat- 
ing available resources in view of the gravity of the difficulty. 

Finally, the problem of an unequivocal causal interpretation is not resolv- 
ed by logically analyzing a given content of study, or even by doing so in 
reference to a specific learning theory. As noted by Cardinet (1988), a clear 
symptom (such as a difficulty with carrying numbers in arithmetic opera- 
tions) may hide a subtler conceptual confusion (such as the function of 
numeration of position). When cut from its classroom context and from a 
broader subject matter theory, the simplistic sequence of “error, causal 
interpretation, and specific correction” seems to operate in a vacuum, be- 
cause it excludes the interaction among the cognitive, affective, and social 
problems as they occur between the teacher and the student in the context of 
the classroom. 


Method of Study 2: The Vertical Approach 


From the previous section, it seems that corrective approaches stand a better 
chance of being effective, and the resultant learning of being closer to op- 
timality, if regulation (as well as its ensuing measurement components) is 
conceived within the classroom context and in collaboration with the teach- 
ers. A new conception of regulation is thus needed, one that will more fully 
take into account the relevant affective variables pertaining to the teachers 
as well as to the students, one that will link the cognitive, affective, and social 
components of learning difficulties to the teachers’ practices. With respect to 
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this conception of regulation, a new trend has appeared in Europe (compare, 
for example, Allal, 1988; Huberman, 1988). It proposes a less technological 
approach to evaluation, a more interactive and qualitative approach more in 
line with psychosocial analyses (compare, for example, Perrenoud, 1988). In 
North America, the need to tackle the evaluation of school achievements in a 
novel way, less dependent on testing and based on more thorough under- 
standing of the classroom considered as a microworld, has led to recent 
developments in the way classroom assessment is studied (compare, for 
example, Stiggins & Bridgeford, 1985; Stiggins, 1988). 

The second study discussed in this article may be situated at the junction 
of these two trends. The method used is based on a mixed paradigm (Van der 
Maren & Painchaud, 1984; Miles & Huberman, 1984). The nature and 
volume of the information required necessitated a compromise between the 
horizontal strategy retained for Study 1 (maximizing the number of stu- 
dents) and a more vertical approach aimed at maximizing the information 
from the basic unit of observation: the classroom. 

The study is currently in its preliminary phase. Conceived as an ex- 
ploratory research taking place in a high school of the Saint Jerome school 
board, its main specific objectives (related to the first objectives of the first 
study) can be summarized as follows: (a) to identify the relevant variables 
involved in managing the information necessary to classroom assessment; 
(b) to develop (or adapt) the instrumentation needed to assess the teachers 
perception, expectations, and practices related to classroom assessment; (c) 
to explore the feasibility and generalization of a model of on-site teacher 
training aimed at developing autonomy in performing diagnostic assess- 
ment. 

A group of six Secondary II (second year of high school) mathematics 
teachers is currently participating in the study. After an initial period of 
negotiation and consultation with the school board, the head of the school, 
and the participating teachers, a model of intervention was elaborated and 
agreed on. Rather than proceeding from an initial a priori theoretical posi- 
tion, this method results from the confrontation of school constraints to the 
theoretical requirements of formative evaluation. Basically, the training 
centers on the preparation of an LU that will require about three weeks to 
teach (from mid-April to mid-May 1989). 

The teachers are currently spending all their available time (about three 
hours every week), with the guidance of the researchers, elaborating peda- 
gogical documents and tests, namely, interpreting the relevant terminal and 
intermediate objectives formulated by the Ministry of Education, formulat- 
ing the domain of reference, devising a potentially useful (according to them) 
hierarchy in terms of the educational diagnosis, deciding on the didactic 
means to be used, elaborating their own assessment tools, clarifying the 
diagnostic interpretation that they plan to perform, and describing the re- 
medial actions that they intend to implement. 

The research team has already proceeded to a rather large collection of 
information relative to (a) the teachers’ fundamental beliefs and attitudes 
regarding evaluation in general and classroom assessment in particular; (b) 
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the teachers’ perceptions and expectations regarding evaluation practices in 
their own classroom, in their own school, and in the system at large; (c) the 
teachers’ practices. The first two types of information were gathered by 
means of interviews and preliminary versions of beliefs and attitude scales; 
also, a comprehensive questionnaire is under development. Observation 
grids to be used in the classroom are being developed in an effort to better 
understand the way teachers really practice formative evaluation. 


Concluding Remarks 


The first study, conceived in the traditional positivist mode of investigation, 
yielded some interesting results concerning the efficiency of mastery learn- 
ing. It showed that in the best possible case, making the most with pedagogi- 
cal documents and tests prepared by outsiders, although custom made, had 
serious limitations. Even if it is unclear whether the sigma improvement 
would have persisted over a longer period of experimentation, the study did 
show, within the limitations discussed by Slavin (1987), that it pays to 
remediate difficulties. However, it also presented some intrinsic limitations 
which were discussed earlier. A new mixed approach has been developed to 
attempt to improve even further the efficiency of the regulation process, 
exemplified by the retroactive loop of mastery learning. Changing the focus 
of the study from the student as a statistical unit to the student-teacher 
relationship (one is tempted to say partnership) requires new research tools. 
The vertical approach, briefly described in this article, is presently being 
refined in the school. Its qualitative aspects, especially as seen by the par- 
ticipating teachers, bear some similarity to action research. By this, we mean 
that researchers and teachers alike are engaged in an ongoing, mutually 
agreed on exchange of information: The teachers are being trained and, in a 
way, the researchers too. However, the latter part of the exchange is only 
temporary and is due to the exploratory nature of the preliminary phase of 
the study. It is also genuinely confined by theoretical considerations relative 
to educational diagnosis and to its integration in a general framework of 
research. Once the present exploratory study ends, the second and third 
objectives described earlier should be pursued in a way that allows for gen- 
eralization of the framework and its reformulation into a model. 


Notes 
1. Ajar, D., Dassa, C., and Scallon, G. Elaboration et application d’un modéle de diagnostic 
pédagogique des difficultés d’apprentissage intégre a un processus d’evaluation formative. 
FCAC-Quebec grant (1982-1985). 
2. Dassa, C. Elaboration d’un modéle d’intégration du diagnostic pédagogique aux 
apprentissages scolaires. SSHRC-Canada grant 410-88-0886. 
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Measurement and Evaluation Practices 
of British Columbia Science Teachers! 


The 1986 British Columbia Science Assessment surveyed science teachers on many aspects 
of teaching practices, including measurement and evaluation practices. Science teachers at 
all grades depend most heavily on their own objective-type tests and activity write-ups for 
measurement information on which to base student evaluations. Oral measurement devices 
decline in use as students progress through the grades. Although the four goals of attitudes, 
skills and processes, knowledge, and critical thinking are equally important in the science 
curriculum, the emphasis in measurement and evaluation is on the knowledge goal, with 
attitudes being virtually ignored. 


During the 1985-1986 school year the third assessment of science was under- 
taken in British Columbia. As part of that assessment, British Columbia 
science teachers were asked to respond to questionnaires about various 
facets of science teaching. Included in the questionnaires were items which 
dealt with measurement and evaluation practices. This article reports and 
discusses the results of those questions. 

The subjects responding to the questionnaires were all practicing science 
teachers of grades 4, 7, and 10 in British Columbia schools. A 20% random 
sample of returned questionnaires at each grade level was drawn from the 
total populations for the purpose of detailed analysis. Table 1 reports the 
numbers of teachers in each grade level sample and their background data. 

The ratio of male to female teachers at grade 4 is almost balanced, but as 
one progresses up the grades the ratio becomes unbalanced. By grade 10, over 
90% of the teachers in the sample were male. In general, the teachers in the 
study were mature and well experienced in teaching, with over half the 
teachers having more than 10 years in the classroom. 

All teachers in grades 4 and 7 were asked a question regarding the degree 
of emphasis they placed on certain types of measurement devices when 
deciding on a final evaluation for their students. The question was “How 
much emphasis do you place on each of the following in deriving a final 
evaluation for your students in science at this grade level?” Table 2 displays 
the teachers’ responses to this question. The Weighted Use column of Table 
2 was achieved by multiplying the percentages of teachers responding in the 
Little Use, Some Use, and Much Use categories by 1, 2, and 3 respectively, 
adding, and then dividing by 100. 

Teachers were also provided the opportunity to suggest different types of 
measurement devices they may have used other than those suggested. None 
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Table 1 
Background Description of Teachers in the Study 


Percentage of Teachers 


Item Category Grade 4 Grade 7 Grade 10 
Number 321 328 324 
Gender Male 51.1 Thee 90.1 
Female 48.9 24.3 9.9 
Age 24 or less 1.0 1.6 0.6 
25-29 10.5 8.8 7.7 
30-34 17.5 17.9 15.8 
35-39 25.4 28.0 26.0 
40-49 33.3 36.8 39.0 
50 or more 12.4 6.9 10.8 
Years of Teaching 1-2 4.4 Sf, 4.6 
3-5 14.8 9.2 8.0 
6-10 20.2 22:5 20.4 
11-15 27.1 27.4 26.9 
over 15 36.4 ee 40.1 


was supplied. As can be seen from the table, teachers rely most heavily on 
their own objective-type tests, with activity/experiment write-ups and 
projects also receiving fairly heavy emphasis. Reports on science topics, 
anecdotal records, and subjective-type tests are the other devices which are 
used to a considerable extent. The only significant differences between grade 
4 and 7 teachers were that grade 4 teachers use anecdotal records and oral 
tests to a higher degree than do grade 7 teachers. In addition, grade 7 
teachers place more emphasis on activity/experiment write-ups. In general, 
however, grade 4 and 7 teachers are very much alike in the emphasis they 
place on particular measurement techniques for the purposes of gathering 
data to evaluate their students in science. 

One surprising result was the lack of use of the Ministry of Education 
Classroom Achievement Tests. There are two tests at each grade level 
(Science Processes and Science Knowledge) available free in class sets to 
teachers in British Columbia. These tests are reliable, multiple-choice tests of 
40 items each, with each item being referenced to the British Columbia 
elementary science curriculum. They are accompanied by teachers’ manuals 
which include a section on standards set by practicing teachers in British 
Columbia. One can only speculate on why these tests do not receive more use 
given the teachers’ reported heavy reliance on objective tests. Perhaps the 
teachers are not aware of their existence, in which case the Ministry should 
publicize them more. On the other hand, perhaps the confrontational at- 
titude of “teachers versus the Ministry,” which has existed in British Colum- 
bia for some time, has stimulated teachers to avoid anything to do with the 
Ministry, in which case perhaps distribution of the tests should be through 
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Table 2 
Emphasis on Types of Measurement in Deriving Final Evaluation of Students 
by Grade 4 and 7 Teachers (Percentage of Teachers) 


No (0) Little (1) Some (2) Much(3) Weighted 
Type of Measurement Emphasis — Emphasis Emphasis Emphasis Use (Rank) 
Teacher-Made Objective Tests 
Grade 4 1.0 2.9 48.1 48.1 2.43 (1) 
Grade 7 0.0 2.2 43.3 54.4 2.52 (1) 
Projects 
Grade 4 3.9 9.3 70.1 16.7 2.00 (2) 
Grade 7 4.1 13.8 63.5 18.6 1.97 (3) 
Activity/Experiment Write-ups 
Grade 4 Se 16.4 62.3 16.1 1.89 (3) 
Grade 7 4.1 a tai 57.6 ire 2.08 (2) 
Reports on Topics in Science 
Grade 4 6.1 14.5 68.1 Iaiso 1.85 (4) 
Grade 7 6.0 16.6 63.6 13.8 1.85 (4) 
Anecdotal Records 
Grade 4 Ta 23.6 58.3 10.8 1:72.15) 
Grade 7 15.8 31.8 46.2 6.2 1.43 (6) 
Subjective Tests (essay, paragraph, etc.) 
Grade 4 13.8 Pa lee 57,6 6.9 1.58 (6) 
Grade 7 9.3 22:8 57.6 10.3 1.69 (5) 
Oral Tests 
Grade 4 35.0 27.6 S2uf, 4.8 1.07 (7) 
Grade 7 47.4 rod | 19.9 aay, 0.76 (8) 
Individual Work Contracts 
Grade 4 52.6 12.8 28.1 6.6 0.89 (8) 
Grade 7 54.1 15.0 23.5 7.5 0.85 (7) 
Student Self-Reports 
Grade 4 51.6 20.8 22ie 5.4 0.81 (9) 
Grade 7 54.3 21.2 22.5 2.0 0.72 (9) 
Attendance 
Grade 4 67.3 Ree 13.6 5.9 0.58 (10) 
Grade 7 66.8 17.4 9.4 6.4 0.55 (11) 
Ministry of Education Classroom Achievement Tests 
Grade 4 62.0 215 15.1 1.4 0.56 (11) 
Grade 7 58.2 22.4 18.4 1.0 0.62 (10) 


another agency such as the British Columbia Teachers’ Federation lesson 
aids service. 

Grade 10 teachers were asked a similar question and the results are 
reported in Table 3. Their question was, “How much importance do you place 
on each of the following in deriving a final evaluation for your students in 
Science 10?” The Weighted Use column in Table 3 was obtained by multiply- 
ing the proportion of teachers responding to the Important and Very Impor- 
tant categories by 1 and 2 respectively, adding, and dividing by 100. 
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Table 3 
Importance of Types of Measurement in Deriving Final Evaluation of Students 
by Grade 10 Teachers (Percentage of Teachers) 


Not (0) (1) Very (2) Weighted 

Type of Measurement Important Important Important Use (Rank) 
Teacher-Made Objective Tests a3 28.3 70.5 1.69 (1) 
Laboratory Write-ups 3.1 59.4 37.4 1.34 (2) 
Subjective Tests 13.6 562 30.2 1707 (3) 
Reports on Topics 19.7 63.5 16.8 0.97 (4) 
Projects 21.7 61.2 17.1 0.95 (5) 
Attendance 33.2 50.0 16.8 0.84 (6) 
Anecdotal Records of 

Work Habits 29.0 63.0 7.9 O.79 947) 
Anecdotal Records of 

Achievement 41.4 39.1 19.5 0.78 (8) 
Classroom Behavior 41.7 43.3 15.0 0.73 (9) 
Standardized Objective Tests 43.0 42.7 14.3 0.71 (10) 
Anecdotal Records of General 

Attitude in Class 47.7 48.3 4.0 0.56 (11) 
Individual Work Contracts 72.3 ae) 6.4 0.34 (12) 
Oral Tests 70.3 Zone 4.1 0.34 (13) 
Student Self Reports 74.5 22.6 2.8 0.28 (14) 


A similar pattern of importance to that seen from elementary teachers is 
apparent. There is a heavy reliance on teacher-made objective-type tests with 
laboratory write-ups also being important. Subjective tests appear to be more 
important at the secondary level than they are at elementary. These three 
devices were the only ones that received a mean importance of more than 1 
(1=Important); however, reports and projects are also relatively highly con- 
sidered. Attendance and classroom behavior become more important and 
oral tests become less important as the grade level increases. As was the case 
for elementary teachers, we see little reliance on standardized objective-type 
tests, although again there are several Ministry of Education Classroom 
Achievement Tests available free to teachers. 

The Junior Secondary Science Curriculum Guide and Resource Book 
(British Columbia Ministry of Education, 1985) lists four goals for the junior 
secondary program: attitudes, skills and processes, knowledge, and higher 
level or critical thinking skills (creative, critical, and formal or abstract 
thinking). Further, it states that “all four goals should receive equal em- 
phasis throughout the junior secondary science program” (p. 5). Grade 10 
teachers were asked the question, “In the determination of the final letter 
grade for your students, what emphasis should be placed on each of the 
following?” Table 4 reports the results of this question. 

As can be seen from Table 4, the goals are certainly not equally weighted 
when it comes to evaluation of students. It is possible that teachers are not 
aware that the goals are to be equally weighted. However, given the experi- 
ence of teachers and the extent of inservice provided for implementation of 
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Table 4 
Emphasis on the Various Goals for Final Evaluation by Grade 10 Teachers 


(Percentage of Teachers) 


Receives About 


Mean 
Goal 0% 10% 20% 30% 40% 50% 60% Weight 
Attitudes 22.05) 105,00) 119.0 2.4 0.7 0.0 0.0 10.5% 
Skills Processes 06. 10:2)" 30:5" 4471 tot 2.5 0.0 26.4% 
Knowledge 0.0 0.6 C9 © 23:1. Oeeeet oon 19.0 Mees 1:9%5 
Critical Thinking Of33.) 22.3 ee 40.0 oir 6.1 1.3 tO Pie2).5% 


the curriculum, this explanation for the lack of equal weight is improbable. It 
seems more probable that teachers are more familiar with, more comfort- 
able with, and trust the techniques for measuring knowledge more than they 
do those for measuring other areas. Skills and processes can be measured 
through laboratory work, and Table 3 shows that teachers use laboratory 
write-ups to a considerable extent. On the other hand, the measurement 
techniques for assessing attitudes and critical thinking are, from my experi- 
ence talking to teachers, neither familiar to teachers nor trusted by them. As 
such, it is probable that the lack of equivalence of the goals in final evalua- 
tions is at least partially due to a lack of trusted measurement information on 
these goals. Teachers need to be provided with good sample instruments and 
the skills and techniques necessary to develop, administer, and interpret 
measurement devices designed to assess areas other than the cognitive 
domain of learning. 

Grade 10 teachers were asked whether they personally had the sole 
responsibility for the final evaluation of students or if some other measure- 
ments (school-wide or district-wide tests) played a part in determining a 
student’s final letter grade. About two thirds of the teachers indicated that 
they had either or both a school-wide and/or district-wide test, while about 
one third said they had the sole responsibility. They were then asked what 
their preference would be as far as external examinations were concerned 
and what weight should be given to an external exam. Table 5 reports the 
results of this question. 

In general, teachers are much more favorable toward a school-wide exam 
than to a district-wide or provincial examination, with only between one 
quarter and one fifth of the teachers agreeing with the latter two. Given that 
teachers put little emphasis on standardized objective examinations as dis- 
played in Table 3, this is an expected finding. If external examinations were 
to come into play, the most popular weight would be about 20% which, from 
a measurement perspective, is not unreasonable. 

The teachers were also asked two questions regarding the distribution of 
marks they assigned to students. The first question is whether they used a 
preset distribution of marks for their science students. Over one third of the 
teachers responded yes, they did use a preset distribution of marks! This 
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Table 5 
Grade 10 Teachers’ Preferences for External Examinations 
(Percentage of Teachers) 


Yes, worth: 
Type of Measure No 10% 20% 30% 40% 50% 
School-Wide Exam 20.6 8.2 50.4 12.8 3.9 4.3 
District-Wide Exam 74.5 9.6 11.6 2.4 0.4 1.6 
Provincial Exam 77.1 Be 8.5 4.8 TA 3.0 


finding is alarming given the extreme differences in ability and achievement 
that occur from class to class, from school to school, and from year to year. 

The second question asked teachers to provide their expectations of mark 
distributions for their science classes. Thirteen percent of teachers said that 
they had no expectations for any given class. The responses from the remain- 
ing 87% are illustrated in Table 6. 

The Mean column was obtained by multiplying the percentage of teachers 
by the midpoint of the range, adding, and dividing by the total percentage of 
teachers responding. The resulting expected distribution of letter grades 
appears to be very traditional and conforms quite closely to the reported 
results over the last few years for English 12, the only government ex- 
aminable subject in Grade 12 that all students must take. 

The results of this study contain few surprises, although the lack of use of 
available, curriculum-referenced, quality objective tests is a finding which 
should cause some concern. Teachers in British Columbia are probably not 
different from teachers in other provinces. They receive little training in 
evaluation in their preservice education, often only part of one course, and 


Table 6 
Grade 10 Teachers’ Expectations of Student Letter-Grades 
(Percentage of Teachers) 


Percentage of Students 


Letter Grade 0-5% 6-15% 16-30% 31-50% 51-100% Mean 
A (Excellent) 56.5 37.7 4.5 1.0 0.3 7.9% 
B (Very Good) 5.8 59.7 31.4 3.1 0.0 10.5% 
C+ (Satisfactory) ae 28.4 60.3 7.4 0.7 23.1% 
C (Satisfactory) ie 15.2 55.3 26.2 2.1 29.8% 
P (Pass) 11.4 44.8 31.0 10.0 2.8 20.6% 
F (Fail) 57.4 35.2 6.0 tat 0.4 8.2% 
No Expectations: irec.shi worth eesitec piece: 13%y &eciup ale, ete oe 
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few teachers elect measurement and/or evaluation as a component of their 
ongoing inservice education. As McLean (1985) points out, evaluation is 
more a craft than a profession; teachers measure and evaluate more through 
“folk knowledge” than from a sound theoretical and practical base (pp. 
32-34). At present, teachers appear to depend heavily on evaluation in the 
cognitive domain, an area rich in folk knowledge. Other areas are severely 
underutilized. The theoretical knowledge and practical techniques for 
measuring all domains of education exist. However, some method must be 
developed to provide teachers with good examples of measurement instru- 
ments and with the existing knowledge and the practical skills necessary to 
use alternate, appropriate measurement and evaluation techniques for the 
assessment of particularly the affective and psychomotor domains. The 
award-winning publication Evaluation Techniques: A Handbook for 
Teachers (British Columbia Primary Teachers’Association, 1985) has been 
well received and has had considerable impact on primary teachers in British 
Columbia. It might serve as one model to be considered. Perhaps an extended 
document containing concepts such as those expressed by Cornett (1982) 
could be produced for all grade levels. In any case, the challenge to the 
Canadian measurement community is to have the knowledge and techniques 
that exist put into practice. 


Note 


1. The data used in this study were collected under contract to the British Columbia 
Ministry of Education and were first reported in the British Columbia Science 
Assessment Reports (Bateson, Anderson, Dale, McConnell, & Rutherford, 1986a; 
Bateson, Anderson, Dale, McConnell, & Rutherford, 1986b). The author is grateful for 
their permission and assistance. 


References 


Bateson, D.J., Anderson, J.O., Dale, T., McConnell, V., & Rutherford, C. (1986a). The 1986 
British Columbia science assessment: General report. Victoria: British Columbia Ministry 
of Education. 

Bateson, D.J., Anderson, J.O., Dale, T., McConnell, V., & Rutherford, C. (1986b). The 1986 
British Columbia science assessment: Summary report. Victoria: British Columbia 
Ministry of Education. 

British Columbia Ministry of Education. (1985). Junior secondary science curriculum guide 
and resource book. Victoria: Province of British Columbia. 

British Columbia Primary Teachers’ Association. (1985). Evaluation techniques: A handbook 
for teachers, Vancouver: British Columbia Primary Teachers Association. 

Cornett, J.D. (1982, September) Alternatives to paper-and-pencil testing. NASSP Bulletin, 
66, 44-46. 

McLean, L.D. (1985). The craft of student evaluation in Canada. Toronto: Canadian 
Education Association. 


51 


The Alberta Journal of Educational Research Vol. XXXVI, No. 1, March 1990, 52-64 


W. TODD ROGERS 


University of Alberta 


Current Educational Climate in 
Relation to Testing 


As in other jurisdictions, there is strong support for testing and assessment in British 
Columbia. Taken as a whole, the data and information obtained in the Let’s Talk About 
Schools study and the Royal Commission on Education, completed respectively in 1985 and 
1988, suggest that there is far more support for new and expanded examination, assessment, 
and evaluation programs. Despite this apparent popularity, members of the measurement 
and evaluation community must be mindful of the concerns raised by those who question the 
“overuse” of tests and continually monitor the way tests are developed and used. 


Communiqué 
Regina, September 27, 1988 FOR IMMEDIATE RELEASE 
EDUCATION MINISTERS DISCUSS NATIONAL ISSUES 


Provincial and territorial ministers responsible for education concluded a two-day 
meeting of the Council of Ministers of Education, Canada (CMEC), in Regina, Sas- 
katchewan, today. 


The ministers agreed that provinces will coordinate the assessment of the quality and 
effectiveness of their education systems. A proposal for a three-phase implementation 
plan was approved. The assessment will examine participation and graduation rates 
and, in the final phase, a means will be developed for assessing achievement levels of 
13 year olds across the country in literacy and numeracy. 


PROVINCE OF BRITISH COLUMBIA NEWS RELEASE 
MINISTRY OF EDUCATION FOR IMMEDIATE RELEASE 
NEWS RELEASE #06-89 DATE: JANUARY 27, 1989 


MAJOR CHANGES TO EDUCATION ANNOUNCED 
Victoria, B.C.—Minister of Education Tony Brummet today announced major new 
policy directions for education in British Columbia. 


Brummet reported that government has accepted the vast majority of the recommen- 
dations of a recent Royal Commission of Education headed by Barry Sullivan... 


Sullivan’s report pointed out that if our education system is to maintain its relevance 
and prepare our children for the future, it must change to meet these new challenges. 


Major changes announced by Brummet include a new provincial curriculum structure 
that will be phased into schools over the next ten years.... 


Expanded assessment and accountability systems aimed at monitoring the progress of 
individual students and reporting to the public will also be developed. The Grade 12 
provincial examination system will be expanded.... 


Todd Rogers is a professor of educational psychology. His research interests are 
measurement, evaluation, and research methods. 
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The above quotations reflect the growing concern with the quality of 
schools and the future of those who attend schools, the students. Briefly, 
parents concerned with the progress of their own children and educators 
responsible for the provision and delivery of quality educational programs 
express uneasiness about the ability of today’s schools to adequately meet the 
needs of students and to prepare students for what can best be described as a 
rapidly changing technological future. Governments, in response to these 
concerns, are directing increasing proportions of their budgets toward educa- 
tion and its improvement. 

That calls for new and expanded testing have been made is predictable 
and reflects what might best be called a theme for greater accountability. 
Such a theme is made evident in public and professional support for stand- 
ards and examinations and in a desire to monitor student performance. 
Clearly there is a belief among the public and professionals taken as a whole 
that standards and testing are useful ways to monitor and evaluate school 
effectiveness and, thereby, to ensure that accountability with respect to 
quality exists within the system. ! Politicians responsible for allocating funds 
for education want to know if they are getting their money’s worth and turn 
to testing as the way to find out. 

Every large education reform of the past few years has called for either 
new tests or expanded use of existing ones. In Britain, for example, new 
national tests are a key tool in the government’s effort to enact far-reaching 
changes in that nation’s education system. Central to this reform is a unified 
national curriculum backed by a series of tests administered to children at 
the ages of 7, 11, 14, and 16. In its recent deliberations about the bill to 
reauthorize federal elementary and secondary education programs, the 
United States Senate added an amendment that calls for a substantial 
expansion and reshaping of the federal program that measures educational 
progress, the National Assessment of Educational Progress. The stated pur- 
pose of the Senate amendment was to improve the effectiveness of schools by 
making objective information about student performance in a variety of 
learning areas (reading, writing, mathematics, science, history, geography, 
and civics) available to policymakers at various levels. 

This purpose closely parallels the statements of purpose that can be found 
in the rash of legislation mandating new or expanded testing enacted in 
various states over the last five or so years in response to the somewhat 
scathing indictment of United States schools by the National Commission on 
Excellence in Education (1983). Petrie (1987) summarized this situation by 
indicating that it is not “too much of an exaggeration to say that evaluation 
and testing have become the engine for implementing educational policy” (p. 
175). 

The call by the British Columbia Minister of Education (Brummet, 1989) 
for increased assessment and testing as part of the new curriculum structure 
to be implemented over the next 10 years is yet another example of this 
trend. The pertinent policy statements and their rationales are: 


1. The regular Grade 12 provincial examination system will be expanded, and student 
performance on Grade 12 scholarship examinations will be given greater recognition. 
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It is important that students have the opportunity to demonstrate their competence, 
in comparison to provincial standards, for as many areas as possible of the provincial 
curriculum which they have studied in their final year of school. Outstanding scholar- 
ship must also be recognized more consistently. 


Where they do not now exist, regular provincial examinations will be developed for 
Grade 12 courses in the Humanities area of the current provincial curriculum. As the 
new provincial curriculum is developed, all Grade 12 courses in the Humanities and 
Sciences areas will have provincial examinations. 


Students will be asked to demonstrate their competence in the other areas of the new 
provincial curriculum, Practical and Fine Arts, prior to graduation. 


Effective in the 1989-90 school year, provincial examinations will be given a 40% 
weighting in determining a student’s final course mark in examinable Grade 12 cour- 
ses. This is a reduction from the present system in which provincial examinations are 
given a 50% weighting. The change reflects a greater emphasis on teachers’ assess- 
ment of the classroom performance of students in their final year of school. 


2. There will be increased emphasis on regular assessment of student performance 
and evaluation of program effectiveness. 


Vitally important to the continual improvement of student programs is careful 
monitoring of how well students are learning in comparison to the intended outcomes 
of instruction. Current efforts of the system in this regard need to be expanded. 


A variety of new assessment and program evaluation procedures will be developed for 
use at the classroom, school, and provincial levels. These procedures will facilitate 
comparing program results to target levels of performance and provincial standards. 


The current Provincial Learning Assessment Program will be expanded and adjusted 
to focus on the objectives of the new provincial curriculum. 


As an additional measure of program effectiveness, the province will continue to take 
advantage of opportunities to participate with other provinces and countries in com- 
parison studies of student performance. 


To provide relevant information efficiently for purposes of monitoring outcomes at all 


levels of the school system, a new student data collection and processing system will 
be implemented in cooperation with districts and schools. 


3. Accountability and reporting procedures at all levels of the system will be improved. 


Students need to be informed about their strengths as well as about areas that re- 
quire further work. Parents, the local community and the general public expect, and 
have a right to receive, information about the operation of the system on a regular 
basis and in an understandable form. The strengths of the current reporting proce- 
dures need to be developed further. 


Students will be provided with continuous feedback from teachers about the skills 
they have mastered and about those that require further development. 


Parents will receive frequent and clearly understandable reports from schools on stu- 
dent progress and competence. 


The community will be provided with annual reports from the school board on the per- 
formance of schools in the district, in relation to provincial and district objectives and 
standards. The reports will reference key indicators, based on information provided 
by the Ministry. 


The Legislative Assembly and the people of the province will receive annual reports 
from the Ministry showing the progress made toward achieving the goals of education 
and the desired attributes, as outlined in the mandate for the system. (pp. 14-16) 


Further evidence of the increased role assessment and evaluation are to 
play in the new provincial curriculum can be seen in the ungraded program 
for the first four years of schooling. During the first of these years (formerly 
kindergarten), 


54 


Current Educational Climate in Relation to Testing 


the individual child’s developmental needs can be assessed and appropriate program 

adjustments made.... Schools will determine the program for each child based upon 

the child’s needs, local priorities, and the capabilities of the school. Children can 

move forward at their own pace. Some students may complete the program in less 

than four years, and some may take longer. (Brummet, 1989, pri2) 

To underscore the importance of these assessment and evaluation initia- 
tives, the Ministry set September 1989 as the deadline for completing the 
planning necessary for the implementation of new and expanded assessment 
and examination procedures. 

Analysis of the Ministry’s assessment and evaluation policy suggests that 
evaluation and testing will play instrumental roles in the new provincial 
program to be implemented in British Columbia. This policy reflects what 
Petrie (1987) has called the engine of educational policy and is in agreement 
with testing policies in other jurisdictions like Great Britain where major 
educational reform is taking place. But is there support for such a role in 
British Columbia? 

Generally, the answer is yes. Support comes from two major educational 
studies completed in British Columbia during the last four years: Let’s Talk 
About Schools (Ministry of Education, 1985) and the Royal Commission on 
Education (Sullivan, 1988). 


Let’s Talk About Schools 


The intent of Let’s Talk About Schools (Ministry of Education, 1985) was to 
provide a valid and comprehensive portrayal of British Columbians’ opinions 
about and preferences for their schools. Unlike a Royal Commission, no 
recommendations were to be formulated. 

This study began in July 1984 when the then Minister of Education 
assembled an Advisory Committee to prepare a background paper which 
would help stimulate public and professional discussion of some of the major 
issues and questions facing schools now and in the years ahead. Of the 41 
discussion questions formulated by the Committee, three organized under 
the heading Standards of Achievement in Schools were related to assessment 
and evaluation: 


1. What standards should be established for all schools, public and other- 
wise? Should standards be established for all grades and all subjects? 
Who should establish these standards? Should standards be monitored? 
How? By whom? 

2. To what extent should province-wide examinations exist? What pur- 
poses can such examinations serve? 

3. To what extent should entrance examinations set by postsecondary in- 
stitutions determine secondary school curricula and standards? (Minis- 
try of Education, 1985, Booklet 5, p. 12) 


Data Base 

February 1 to mid-March 1985 was the period set aside for public and 
professional discussion of the 41 questions plus any other issues that par- 
ticipants wished to raise. Altogether 3,525 individual responses from parents 
and members of the public, 969 individual responses from education profes- 
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sionals (teachers, administers, school trustees, teacher educators), 482 let- 
ters, 316 group submissions, and 10 petitions (signed by a total of 5,110 
people) were received. During this same period, 567 meetings with an es- 
timated total attendance of 15,575 were held throughout the province. 


Analysis 

The coding system developed to analyze responses was generated from the 
answers provided by the respondents to the discussion questions. For each of 
the 41 questions, the statements made were grouped according to common 
elements. Themes were then extracted and designated as “coding com- 
ponents.” Codes reflecting the variation within each component were then 
formulated. For example, analysis of the answers given in response to Ques- 
tion 2: “To what extent should province-wide examinations exist? What 
purposes can such examinations serve?” led to the identification of seven 
components. The first addressed the issue “Should there be province-wide 
examinations?” The corresponding codes were 1=yes, 2=no, and 3=partly. 
The second component, with its 14 codes, captured the functions and pur- 
poses held by the respondents for these tests. The third component with its 
four categories was necessary to reflect the negative consequences of testing 
identified by respondents opposed to province-wide examinations. Compon- 
ents 4 and 5 picked up those respondents who identified the grade(s) and/or 
subject(s) for province-wide testing. Some respondents proposed qualifica- 
tions regarding the types of students to be tested, thus the need for a sixth 
component. Finally, it was necessary to introduce a seventh component to 
capture statements made about the degree to which province-wide examina- 
tions should count toward final or year-end grades. 

Table 1 presents a summary of the coded responses to the three achieve- 
ment standards questions. The first column lists the components with their 
codes. The next two columns contain the percentages of parents and general 
public (column 2) and education professionals (column 3) for each of the 
codes. The corresponding percentages for meetings, letter writers, and 
groups appear in columns 4 to 6. In all cases the percentages are based on the 
number of respondents, N, coded in each component. Percentage values are 
not reported for the code “Other” which includes responses made by fewer 
than 5% of the respondents for that component. 


Findings 

As shown in Table 1, for the most part the component percentage distribu- 
tions across the five reporting groups are comparable. In order to simplify 
reporting in the presentation of highlights which follow, the term respondent 


is used rather than separately listing the five samples in these cases. Where 
differences occur, the samples are identified by name. 


1. What standards ... ? 

More than 90% of the respondents agreed that achievement standards 
should be established for all forms of schooling. Approximately half favored 
provincial standards, while national standards were preferred by one third of 
the general public; by one fifth of the education professionals, letter writers, 
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Table 1 
Standards of Achievement in Schools 


GENERAL EDUCATIONAL MEETINGS LETTER GROUPS 
PUBLIC PROFESSIONALS WRITERS 
RESPONSE DOCUMENTS RESPONSE DOCUMENTS 

1 What standards should be established 
for all schools, public and otherwise? 
Should standards be established for all 
grades and all subjects? Who should 
establish these standards? Should 
standards be monitored? How? By whom? 

1.1 Should standards be established for all N = 2514 N — 676 N = 344 N — 90 N = 91 
schools, public or otherwise? = sae REN ARR 5 re, =e 
Yes 95.0 925 98.0 94.4 96.7 
No 2.0 27 06 11 22 
Other (public only, private only, not possible 

to set standards) 

1.2. What should be the reference group for — N=1022 N -. 247 "N=178 N = 42 N= 50 
standards? Se ele a 4, 3 eee Be 
National 348 186 33.7 238 18.0 
Provincial 53.6 50.6 57.9 50.0 48.0 
District 35 134 6.7 24 60 
School 49 73 22 0.0 10.0 
Individual Student 12:7 22.7 8.4 190 32.0 
Other Suggestions (international, classroom) 

1.3 Which grades should standards apply to? ~=3=39N=1106 N - 218 oP NS 1020 N24 oi. pan 
All Grades oc 62. dae 674 oreGra 625 a a 75.0 
Elementary 65 87 98 125 83 
Junior Secondary 6.0 73 88 125 56 
Senior Secondary 91 165 11.8 125 56 

1.4 Which subjects should standards apply to? N = 1074 N - 212 N = 95 N- 23 N = 34 
All Subjects 59.0 443 316 30.4 i wesaarl. 
Core Subjects 38.5 524 62.1 609 412 
Other Subjects (special education subjects, 

vocational subyects) 

1.5 What is the basis for the standards? N = 935 N . 254 N = 133 N ~ 56 “N= 50 
Average i 6.1 BS) , 15 18 eating 0.0 “os 
Higher Than Average 84 TA 68 ait 2.0 
Mastery (when specifically stated) 14.7 14.2 12.8 89 40 
Minimum Competence (functionally literate) 229 248 24.1 16.1 46.0 
Honours/Pass/Fail 49 55 6.0 54 6.0 
Excellence (highest possible) 335 Sit 45.1 66.1 40.0 
Other (system based on percentages, guide 

lines only, present system Is fine as is) 

1.6 Who should be involved in setting standards? N= 1548 N = 399 "N= 118 j N - 31 N = 57 
“Set. Consu.* Set. Consu Set. Consu. Set. Consu. i. “Set. Consu. 

Province (Ministry of Education) 530 1.8) 446 40 44.1 1.7 613 00 509 35 
School Board (district, central administration) Wee) pei} 188 45 93 76 32 00 193 105 
Professionals (schools, BCTF, teachers, 

principals, experts. school administrators) Ss 6r 7.0 383 11.0 322 119 194 65 263 193 
Parents 108 48 90 35 68 59 65: sam 172 193 
Broad-based Committee (committee with mixed 

range of representatives) 56 04 Hbshe Tels) 85. 08 32 1010 53 50.0 


Other Suggestions (Provincial specialist 
associations, local specialist associations, 
private schools set own. pupils) 


*NOTE: Set, — Set the standards, Consu. Consultant (0 assist in sett) standads ee 

1.7. Should standards be monitored? N = 1311 N - 301 N = 93 N - 39 N = 56 
Yes “O71 96.0 meroze 897 96.4 
Other (no, not really, teachers should be 

monitored) 

1.8  Howshould they be monitored? N = 763 N 162 N= 51 N. 19 N = 34 
Government Exams “ae Ucs4) = 586 58.8 684 47.1 
Teacher Tests (exams, tests, unspecified) 28.7 30.9 45.1 263 412 

Classroom Observation (class visits, general 

assessment) 10.2 148 78 53 235 
Learning Assessments (Canadian Achievement 

Tests, Provincial Learning Assessment 

Program) : 83 ee} 3.9 158 17.6 

1.9 By whom should they be monitored? N = 970 N . 235 N=50 N — 21 N= 40 

Provincial Officials (Ministry) 445 396 40.0 619 60.0 
District Officials (Superintendent, district : 

administrators) 13.1 22.1 22.0 48 125 
Schoo! Board 203 170 10,0 238 175 
Principals (school administrators) 11.0 ley 20.0 95 10.0 
Teachers (BCTF) ee 289 26.0 333 30.0 
Independent Committee (broad based 

committee) 72 55 8.0 00 78 

13.0 Wd 40 95 5.0 
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Table 1 (continued) 


2 To what extent should Province wide 
examinations exist? What purposes can 
such examinations serve? 
: : = 5 = N = 68 N = 76 
2.1. Should there be Province-wide N = 2459 Ni = 675 aatcee™ 2) 
examinations? ee 
765 683 89.6 735 TI6 
Yes 145 
No 149 190 5.0 19.1 
A 74 79 
Qualified yes (“Yes, but ...”) 7.0 113 43 
Other (uncertain, not sure) 
2.2 The following purposes were suggested * = 
by the respondents who favoured (yes and = 1647 N — 379 N = 150 2 N= 41 N = 52 
qualified yes) Province-wide testing: = zs “=a 
Use with Standards (way to see if standards 385 
and guidelines are being met) 45.7 433 42.0 61.0 
Normative Comparisons (how we stand relative 49 38 
to others) 58 82 10.0 i 
Ensure Uniformity (quarantee equality of 
education throughout the Province/districts, 
same education for all in B.C.) 20.0 12.4 22.7 24.4 sa 
Ensure the Curriculum is Being Followed 11k?) 12.1 12.0 12.2 le 
Measure Achievement 129 148 14.0 195 
Provide Feedback at Student Level 85 13.5 133 73 23.1 
Assist with Student Placement 10.9 11.4 53 98 173 
Assist Evaluate/Monitor Students 20.6 16.6 10.6 195 25.0 
Assist Evaluate/Monitor Teachers 20.1 145 173 Sh le7/ 154 
Assist Evaluate/Monitor Schools 86 55 33 49 1.9 
Other Purposes (provide feedback to parents, 
provide feedback to employers, evaluate/ 
monitor principals and district staff) 
2.3 The following reasons were given by the 
respondents who opposed Province-wide N = 186 N = 74 
examinations: —_— ———_ . = 
Inhibit Learning (an ineffective way of testing, 
too much stress on students, negative effect 
on testing practices) 48.4 55.4 Insufficient Insufficient Insufficient 
Teachers Teach to Test 26.9 37.8 slid My canbe be ; suede ree 
respondents to respondents to 
Expensive/Waste of Time 323 25.7 conduct Lae conduct 
District/School Responsibility 10.8 12.2 analysis. analysis analysis. 
2.4 Which grade should be tested? N = 838 N= 211 N = 102 N= 14 N = 36 
Grade 1 232 123 0.0 00 0.0 
Grade 2 23.6 123 21.6 00 0.0 
Grade 3 272 18.0 245 EA 305 
Grade 4 273) 190 32.4 00 25.0 
Grade 5 25.7 14.7 fas}is} 00 25.0 
Grade 6 31.6 20.4 28.4 ad 27.7 
Grade 7 414 30.8 46.1 214 36.1 
Grade 8 42.4 28.0 46.1 143 36.1 
Grade 9 48.9 27.0 51.0 21.4 38.8 
Grade 10 56.0 A417 64.7 21.4 416 
Grade 11 575 408 56.9 35.7 50.0 
Grade 12 90.7 91.0 90.2 TIA 88.8 
2.5 Which subjects should be examined? N=38 #$N=@3 me to ae 
All Subjects 222 140 "O94 rer) ae) insufficient 0.0 ‘ 
number of 
Academic and Core Subjects 69.3 818 67.6 respondents to 100.0 
Other (special education, vocational) conduct 
analysis 
SS SSS 
2.6 How should the examinations count toward N = 401 N = 156 N = 107 N = 20 
a final grade? How should they be weighted? ~~ $$ ee Se 
Less Than 50% 40.1 55.1 495 Insufficient 50.0 
50% 9.7 90 122 aie of 0.0 
Mo respondents to 
re Than 50% 6.0 32 5.6 conduct 0.0 
Partial Only 30.4 199 46 analysis. 25.0 
Not Final (do not count as part of final mark 
at all) 10.0 7 28.0 25.0 
SSD 
3 To what extent should entrance require 


ments set by post-secondary institutions 
determine secondary school curricula 


and standards? 

3.1 Should post-secondary entrance require- 
ments determine secondary curricula N= 1714 N = 447 N=91 N = 30 N = 42 
and standards? = ee ee 
Yes 78.9 68.9 70.4 833 T1A4 
No 183 286 26.4 16.7 28.6 


Other (uncertain, not sure) 
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and groups; and by one third of the meetings. Approximately 20% of the 
education professionals and 30% of the groups indicated that standards 
should be set at the individual level (Ministry of Education, 1985, 1.1, 1.2). 

A majority of respondents indicated that standards should be established 
for all grades. They were less certain about which subjects should have 
standards. Six out of 10 meetings, letter writers, and groups favored re- 
stricting standards to core subjects, while six out of 10 public respondents 
suggested standards should be applied to all subjects. Education profes- 
sionals were more evenly split (1.3, 1.4). 

Generally, respondents preferred maximum performance over minimum 
competency (1.5). 

Preferences of the respondents concerning who should establish stand- 
ards were, in order, provincial officials, education professionals, and school 
board trustees (1.6). 

Respondents wanted the standards monitored and suggested that this 
responsibility be given, in order of preference, to provincial officials, teachers 
and district officials, and school board members. Provincial examinations 
and classroom tests were preferred methods for such monitoring (1.7, 1.8, 


Car 


2. To what extent should province-wide ...? 

Approximately 80% of the general public, education professionals, letter 
writers and groups, as well as 90% of the meetings, called for province-wide 
examinations. Of these respondents, more than 90% advocated Provincial 
testing at Grade 12. Further, beginning with Grade 1, the percentages tended 
to increase from one grade to the next, with marked increases at the grade 
levels corresponding to the end of elementary and junior secondary schools. 
Overall, the public placed greater emphasis on testing than did education 
professionals (2.1, 2.4). 

The most frequently cited reasons given for provincial exams by their 
supporters were to uphold standards; to ensure uniformity among districts, 
to measure, evaluate, and monitor student achievement; and to evaluate and 
monitor teacher performance. Those opposed to province-wide tests stated 
that such tests “inhibit learning,” “lead teachers to teach to the tests,” and 
“are expensive and a waste of time” (2.2, 2.3). 

Slightly more than two thirds of the general public and meetings, 82% of 
the professionals, and all groups suggested provincial tests should be ad- 
ministered in academic and core subjects. Of those who commented on the 
degree to which province-wide tests should count toward final grades, the 
majority indicated less than 50% (2.5, 2.6). 


3. To what extent should entrance ...? 

Approximately 80% of the public and letter writers felt postsecondary 
entrance requirements should determine secondary school curricula and 
standards. About 70% of the education professionals and groups also en- 
dorsed this view (3.1). 
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Royal Commission on Education 


The Royal Commission study of British Columbia schools (Sullivan, 1988) 
was initiated in March 1987 and completed in July 1988. Its tasks were to: 


1. Report on the state of British Columbia schools in both the public and 
nonpublic sectors; 

2. Identify areas of difficulty or concern as part of an assessment of the 
present and future needs of the system; 

3. Identify the kinds of preparation programs school graduates will require 
to meet the challenges of the next two decades; 

4. Determine what school programs and administrative processes should 
be implemented to deal with current and emerging problems; and 

5. Recommend some responsible paths of action based on comprehensive 
understandings of the system’s characteristics, priorities, and the social 
and economic contexts in which schools operate (p. 4). 


Data Base 


During the period set aside for data and information gathering (May 1987 
through March 1988), the Commissioner and his staff visited 139 schools in 
89 communities, participated in 23 student assemblies, and held 66 public 
hearings. In addition to these public meetings, 54 meetings were held with 
teachers alone. In all, 2,099 written and 231 oral submissions were received. 
Over the course of the data collection period, additional meetings were held 
with representatives of government agencies and the major provincial educa- 
tional organizations to clarify points made in written submissions received 
beforehand. In addition to the information presented in the oral and written 
submissions, the Commission sought and obtained information from a vari- 
ety of educational reports from other provinces and from jurisdictions out- 
side of Canada, as well as from journal articles, government reports and 
commissioned position papers, and research conducted by Commission staff. 

In light of concerns with parents’ rights and responsibilities, the preschool 
experience of children, and the articulation between school and work, three 
working groups were appointed to advise the Commission on these issues. 
The Commission was similarly concerned that the views of students be 
heard, especially those just about to complete school. Accordingly, a student 
task force consisting of 54 senior secondary students from throughout the 
province was established for this purpose. 

During the course of the Commission, the Commissioner was assisted by 
SIx research teams within six broad areas related to education in the pro- 
vince. Each team systematically analyzed all the information described 
above. As well, the team members attended public hearings, visited schools, 
participated in conferences, reviewed research articles and reports pertinent 
to their own areas, consulted experts, commissioned their own position pap- 
ers, and conducted their own research. At the completion of its work, each 
team prepared a technical report containing the results of its analysis and a 
series of suggestions designed to close the gap between “what is” and “what 
ought to be.” 
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Following submission of the six team reports, representatives from each 
team systematically analyzed all reports to develop, in concert, an integrated 
set of suggestions for consideration. This process resulted in the formulation 
of acommon set which served as the primary input to the final reeommenda- 
tions presented in the Commissioner’s report. 


Findings Related to Testing, Assessment, and Evaluation 


In contrast to Let’s Talk About Schools, numerical summaries were not 
reported in the Royal Commission report. Instead, summary statements 
were made, supported by anecdotal information gleaned from the submis- 
sions made to the Commission. 

With respect to testing, assessment, and evaluation, the Grade 12 Provin- 
cial Examination Program attracted the greatest attention of the general 
public, educational professionals, and students, similar to the situation for 
Let’s Talk About Schools (see Table 1). Written by all grade 12 students who 
wish to graduate, these examinations were reintroduced in British Columbia 
in 1983, after having been discarded in the early 1970s. The decision to 
reintroduce provincial examinations was essentially a political decision. 
Three reasons were advanced by the then Minister of Education: 

Firstly, our government believes strongly that students graduating from our high 

schools should be achieving challenging standards that are consistent across the 

province.... Secondly, a graduate applying for admission to a college or university 

should be assured of being treated equitably in the admissions process and be assured 

that high school marks will be taken at face value.... Finally, we introduce testing to 

reassure the public that we are serious about quality in our schools. (Henrich, 1983, 

p. 59) 

No pressure was exerted from within the system to reintroduce Grade 12 
“government” examinations. Indeed, many teachers and other educators at 
the time expressed opposition to their reintroduction (see Heinrich, 1983). 
That resistance has gradually waned; despite the controversy surrounding 
the reintroduction of these examinations, “the majority of students, parents, 
teachers of Grade 12 subjects, and the public wish them to be continued” 
(Marx & Grieve, 1988, p. 103). One particularly enthusiastic supporter said 
that “provincial exams were one of the best things to happen to B.C.” 

Not everyone, though, was supportive of the examination program. One 
student commented that, “The present system for provincially administered 
final exams is an unrealistic measure of the student’s true skill and is valued 
far too highly in terms of the student’s final mark.” An anonymous brief 
stated categorically: “Get rid of government exams. They’re destroying 
teaching programs and have too much weight.” 

One recurring issue with respect to the grade 12 examinations raised by 
supporters as well as opponents of the program concerned the weight given 
to the government exam in the final grade for a course. A number of recom- 
mendations were made to change the current division—50% teacher mark, 
50% government exam—to accord more weight to the school mark. The most 
common recommended relative weighting was 60-40. Interestingly, during 
the time of the Royal Commission, the Student Assessment Branch, the 
division within the Ministry of Education responsible for the Grade 12 ex- 
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amination program, was reexamining the weighting issue in light of the Let’s 
Talk About Schools finding that the majority of respondents felt that the 
50% weighting given to the examinations was excessive. The Branch con- 
cluded its analysis with the recommendation that the current weighting be 
changed to 60-40. The same suggestion was made by the Commission’s 
curriculum research team (Robitaille, Oberg, Overgaard, & McBurney, 1988, 
p. 49). However, following its analysis of learning and teaching functions and 
activities, the Commission’s research team responsible for the learners ar- 
gued that “the learners of British Columbia would be better served if the 
Provincial Exams were eliminated” (Marx & Grieve, 1988, p. 103) and 
replaced with an “emphasis on classroom practices that use criterion-refer- 
enced evaluation” (p. 106). 

Following discussion leading to an integrated set of suggestions, the 
Commission recommended: 


That provincial Grade 12 examinations be extended to all subject areas and that 

those examinations count for one-third of the student’s Grade 12 marks. (Sullivan, 

1988, p. 108) 

Few comments were received by the Commission concerning the Learn- 
ing Assessment Program and the province’s participation in international 
studies. It seems respondents were more aware of and prepared to comment 
on issues related to “high stakes” examination programs which yield scores 
for individual students. They appeared to be either less aware of, or perhaps 
less concerned with, “lower stakes” programs which yield aggregated group 
scores. However, the curriculum research team (Robitaille et al., 1988, pp. 
24-25) suggested that the purpose and procedures of the assessment program 
be more clearly tied to curriculum revision, and that sampling and testing 
techniques be employed in order to obtain system-level data on students’ 
performance in the basic subject areas on an annual basis (p. 48). The 
Commission accepted these suggestions and recommended: 

That the goals of the Provincial Learning Assessment Program be reformulated to 


concentrate on the provision of annual data on student achievements (knowledge, 
abilities, attitudes) in each of the basic subject areas. (Sullivan, 1988, p. 111) 


Summary 


Taken as a whole, the data and information obtained in the Let’s Talk About 
Schools study and by the Royal Commission on Education suggest that there 
is far more support for than opposition to the policy directions announced by 
the Minister of Education on January 27 toward new and expanded examina- 
tion, assessment, and evaluation programs. More than 90% of public and 
professional respondents to the Let’s Talk About Schools discussion held that 
standards should be set for all forms of schooling, including home schooling 
and schools that operate outside the public system. Eighty-two percent of the 
public and 67% of the educators who expressed their views indicated that 
standards should apply to all grades. Further, public and professional endor- 
sement of testing climbed steadily grade by grade, with more than 90% of 
respondents supporting the use of Grade 12 provincial examinations (Table 
1, Item 2.4). There is clearly a belief among both the public and professionals 
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that standards and testing are useful ways to monitor and evaluate school 
effectiveness and, thereby, to ensure that quality exists within the system. 

Further evidence of support for the Minister’s policy directions can be 
inferred from the reactions to the Royal Commission. Concurrent with the 
release of the Commission’s report, the Minister of Education appointed an 
Education Policy and Advisory committee composed of representatives of the 
major education professional organizations and community organizations 
with interests in education. The purpose of this Committee was to advise the 
Minister regarding the implementation of the Commission’s recommenda- 
tions. To assess further reactions to the Commission’s recommendations, the 
Minister met with student, parent, teacher, principal, and trustee groups in 
35 school districts located throughout the province. The consultation process 
demonstrated that “there is strong support for the great majority of the 
Commission’s recommendations” (Brummet, 1989, p. 7). Of particular rel- 
evance here is the observation that with the exception of the Commission’s 
recommendation that Grade 12 examinations count for one third of a stu- 
dent’s Grade 12 grade, all other recommendations related to testing, assess- 
ment, and evaluation were accepted. 

Despite this strong support for testing and assessment in British Colum- 
bia, the testing community in this province as elsewhere (see Madaus, 1985), 
must continually monitor the way in which tests are developed and used. We 
should remain mindful of the concerns raised by those who question the 
“overuse” of tests. For example, we saw two of the advisory research teams 
to the Royal Commission disagree on the continued use of Grade 12 govern- 
ment examinations. This apparent contradiction is centered on the wide- 
spread concerns that the content of the examinations has unduly influenced 
the curriculum and that teachers teach only what is considered likely to be 
tested. In the same vein, tests are frequently criticized for apparently focus- 
ing only on that which is easy to test: recall of factual information, applica- 
tion of routine algorithmic procedures, and the like. We need to be aware of 
such concerns. We must be prepared to give the consumers the protection 
they deserve by being mindful of testing, assessment, and evaluation as 
practiced and to reassert the fallibility of measurement even in the face of 
those who are obsessed with the infallibility of numbers.°® 


Notes 


1. While the majority of educators believe that standards and testing are useful in 
monitoring school effectiveness and quality, they are concerned about testing practices 
which may impact negatively on students’ academic growth and self regard. 

2. The six areas were Schools and Society, Learners, Curriculum, Teachers, Finance, and 
Governance and Administration. 

3. Toward this end, the British Columbia Ministry of Education commissioned in March 
1989 a study team to assess by March 1990 the impact of the Grade 12 program on 
British Columbia’s educational system. 
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Toward Improving Assessment of Students 
with Special Needs: Expanding the Data 
Base to Include Classroom Performance 


In this paper, two traditional methods of assessment (the medical model and the psycho- 
educational model) are reviewed with the objective of illustrating the need to expand the 
data base included in assessment procedures. It is argued that there are at least two errors 
of measurement associated with traditional assessment practices; error due to the focus on 
a single measure or group of measures and error due to the restricted sample usually 
associated with these assessment practices. Following this review, a model of authentic 
measurement ts introduced. In implementing this assessment model, a stronger emphasis is 
given to collecting descriptive classroom-based information over time and to evaluating all 
obtained data more critically in terms of what youngsters are required to do during 
assessment. Finally, it is argued that in making these changes, more effective instructional 
decisions can be made for students with special needs. 


From the perspective of the special educator, measuring the classroom per- 
formance of normally achieving students is relatively straightforward. Such 
students are fairly consistent in their classroom performance. Thus any 
estimate of classroom achievement is a reasonable approximation of what 
has been learned to date. Further, when faced with some form of classroom 
assessment the majority of normally achieving students do not get upset or 
refuse to cooperate. They tend not to have a lengthy history of negative 
experience with assessment that must be overcome. In addition, normally 
achieving students typically do not have difficulty reading or listening to 
instructions. Such students usually possess sufficient skills so that (a) they 
know how to prepare for most classroom tests, (b) they can prepare at least a 
general test-time utilization plan, and (c) they are capable of making sense 
out of test items. Students with special needs often lack such test preparation 
and completion skills. (The term students with special needs will be used in 
the majority of cases throughout the remainder of this article for the sake of 
brevity. It should be remembered, however, that the term special needs 
normally is used to describe a wide range of students but is restricted here to 
students with mild to severe learning and/or emotional difficulties.) 
Obtaining valid measures of classroom performance is equally important 
for students with special needs as it is for normally achieving students. Two 
main problems remain to be overcome before the abilities of students with 
special needs can be described fairly and usefully. The first is how to incor- 
porate classroom data into existent diagnostic-prescriptive models. The 
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second is how to obtain technically adequate classroom-based diagnostic 
information. 

A few assessment techniques, appropriate for either classroom or clinical- 
diagnostic use, are described. The advantage of these techniques is that they 
can be used to assess students who are achieving normally or students who 
have special needs. In addition, the use of the suggested assessment techni- 
ques is incorporated into an educational decision making model. The proce- 
dures discussed have been derived from over 15 years of classroom and 
clinical experience with youngsters who have displayed learning and/or emo- 
tional handicaps. Before explaining this method, I review clinical-diagnostic 
assessment practices critically and give‘a rationale for changing assessment 
practices for students with special needs. 


Current Clinical-Diagnostic Assessment and 
Instructional Practice 


Psychoeducational assessment currently is the most widely practiced model 
of clinical-diagnostic or diagnostic-prescriptive assessment. It has emerged 
as the dominant model of assessment practice in a variety of facilities, 
ranging from neighborhood schools to specialized clinics and units of child- 
ren’s hospitals (e.g., Forness et al., 1980; Sattler, 1988; Ysseldyke et al., 
1983). Although the model is implemented in slightly different manners as a 
function of the youngsters being assessed, most typically a school psycholog- 
ist administers and scores a series of individual standardized tests. Inter- 
pretations of the test performance of students with special needs are then 
based on test score patterns or discrepancies between test and subtest scores 
(Hessler, 1986; Reynold, 1985). 

When proposed, this approach was a departure from the predominant 
medical model. In the latter strategy, as might be inferred from the name, the 
purpose was to isolate causal explanations of poor performance, which were 
often assumed to be specific to an individual’s constitutional makeup. Advo- 
cates of the psychoeducational model called for a shift from causal explana- 
tions to correlational hypotheses. The new assessment goal is to define a 
series of factors (related to how youngsters learn and behave) that contribute 
to lack of success in school. 

As the psychoeducational model has been implemented, however, educa- 
tors have come to depend on other individuals to obtain much of the informa- 
tion about students with special needs. Most frequently psychologists are 
requested to administer an assessment battery and to prepare a report in 
which students’ learning patterns or personality are described. Extensive use 
has been made of norm-referenced measurement (Morrison, 1984; Smith, 
1982; Stone, Cundick, & Swanson, 1988). Measures administered have in- 
cluded individually administered intelligence, neuro-psychological, and be- 
havior-personality tests. Depending on the reason the youngster has been 
referred for assessment, other specialists may be involved to check, for 
example, degree of visual impairment or hearing loss. The primary objective 
is first to classify youngsters into special education categories, such as men- 
tally handicapped or learning disabled, and then to determine an appropriate 
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education placement (Forness & Cantwell, 1982; Garrett & Brazil, 1979). 
Instructional planning follows after the student has been assigned to a 
classroom. 

Accompanying the change from a medical model of assessment to a 
psychoeducational one have been shifts in the special education service del- 
ivery system. One of the most significant was the addition of a generalist 
school-based special educator: the learning resource teacher (Dunn, 1973; 
Hardy, 1977). The introduction of an assessment-instructional specialist in 
the home school has had ramifications for the assessment process. Learning 
resource teachers now have the role of assisting youngsters in the regular 
classroom. When first introduced, they typically started with a referral from 
the regular classroom teacher. This consisted of descriptive information 
about the student and his or her present problem or problems. The detail and 
precision of such obtained referrals were a function of at least two factors: (a) 
the classroom teachers’ time to and skill in preparing referral information 
and (b) the type of information on which the teachers were basing their 
referrals. The learning resource teachers then would assess each youngster. 
Subsequently, instruction would be provided to small groups of students with 
similar difficulties, the students being removed temporarily from the regular 
classroom during this instructional period. 

Thus a modification emerged in the practice of the psychoeducational 
model of assessment. The resource teacher assumed command of educational 
assessments, and the psychologist retained responsibility for the psycho- 
educational assessment used to determine program placement. In addition, a 
new layer of service delivery resulted, in which the learning resource teacher 
worked in cooperation with the classroom teacher to provide school-based 
instruction. 


Challenges to the Psychoeducational Model 


Although the application of the psychoeducational assessment model with 
students with special needs has been an improvement over the causal at- 
tributions resulting from the use of the medical model, its use now is being 
reexamined (Algozzine & Korinek, 1985; Gelzheiser, 1987). Two challenges 
have emerged recently. First, a challenge to the service delivery system and 
to the categorical system of classifying youngsters is emerging. Second, the 
adequacy of present assessment practices is being queried. Taken together, 
the implication inherent in these challenges to the current psychoeducational 
model is that several aspects of the present system of special education must 
be rethought. 


Challenging the Categorically-Based Psychoeducational 

Service Delivery System 

At the present time a challenge is emerging to the special education service 
delivery system (Hallahan, Keller, McKinney, Lloyd, & Bryan, 1988; Kauf- 
fman, Gerber, & Semmel, 1988; Keogh, 1988; Singer & Butler, 1987; Stain- 
back & Stainback, 1984). In its most extreme form, proponents of what has 
been termed the “regular education initiative” are calling for dramatic chan- 
ges such as the appointment of school-based personnel to facilitate the 
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integration of all handicapped individuals, the expansion of regular cur- 
riculum (including such topics as community living skills, sign language, and 
braille), and the retraining of special education teachers to focus on instruc- 
tional categories rather than on student categories (that is, a noncategorical 
model). Once the extreme positions are tempered, however, there are two 
main objections to the psychoeducational model: (a) the categories presently 
used to classify students with special needs are neither valid nor reliable, and 
(b) the assessment process is costly, time consuming, and lacking in educa- 
tional relevance. 

Further, the relationship between classroom assessment and instruction- 
al procedures and those completed in the resource room have become remote. 
Once the referral has been made and the student accepted for special ser- 
vices, the classroom teacher and the special educator to a large degree work 
independently of each other. In part due to this independence, a significant 
discrepancy has emerged in the tools used to assess youngsters referred for 
special education and those assessment strategies used to determine their 
classroom performance. Although classroom observation has been utilized, 
classroom performance has not been used extensively in conducting 
psychoeducational assessment. As a result, placement decisions were not 
based on what students were capable of doing in the classroom from which 
they were referred. In addition, the learning resource teacher or other special 
education teachers have assumed a large portion of the responsibility for 
teaching the student with special needs, and the regular classroom teacher 
has turned his or her attention to the remainder of the class, completely 
distancing any student with special needs from his or her classmates. Thus 
the special educator has learned to rely on assessment and teaching 
strategies that are divorced from youngsters’ daily classroom performance 
and expectation. 


Challenging the Conception of Students 


Measuring what students with special needs have learned is confounded by 
the ineffective learning strategies they typically display. Examples are sear- 
ching for easy solutions, failing to persist at work, being poorly organized, 
setting unrealistic goals, and attacking work in a rigid manner. Having a 
cluster of faulty learning habits, however, is not the only significant marker. 
Equally important is the social value attached to these faulty learning habits; 
that is, an answer to the question, “Is the absence of specific learning 
procedures or of particular knowledge seen as a significant weakness?” must 
be obtained. If the risk associated with lacking some ability is minimal, there 
will be few negative consequences assigned to that incompetence. For ex- 
ample, the student who cannot sing may be embarrassed in music class, but 
the probability of being identified as having learning problems because he or 
she is a poor singer approaches zero. On the other hand, if the ability in 
question is reading, the chance of being so identified is high. The final aspect 
to be considered is hinted at in the last example; that is, how frequently will 
any ability, skill, or knowledge be required? 
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Placed in perspective, it seems logical to think of all youngsters as falling 
along a continuum of competence, rather than as being divided into a set of 
discrete categories. There will be some school specific tasks at which students 
will be successful and some at which they will need extra assistance in order 
to learn more effectively. The goal of assessors completing either classroom 
or clinical assessments, then, is to obtain a description of where students fall 
along this continuum. As discussed above there are three facets to be con- 
sidered: (a) the degree to which students’ learning strategies are faulty or to 
which they lack necessary knowledge, (b) the importance or social value 
attached to having either the appropriate learning patterns or information, 
and (c) the frequency with which such abilities, skills, or knowledge are 
required in the classroom. The greater the magnitude of and the more the 
interplay between these facets, the greater the perceived weakness and the 
stronger the negative valence attached to the description of the student: In 
short, the performance of youngsters must be viewed in a relative rather 
than in an absolute manner. 


Challenging the Assessment Model 


As they have begun to assume a more comprehensive assessment and in- 
structional role within the neighborhood school, learning resource teachers 
slowly have started to question the adequacy of current assessment techni- 
ques. Two problems have been identified. 

The first problem is that the process of making placement decisions has 
become cumbersome (Ysseldyke, 1983; Ysseldyke et al., 1983). To review, a 
series of tests is administered by both the school psychologist and the re- 
source teacher. Other professional personnel are consulted on a case-by-case 
basis. After this process is completed, a school-based team (consisting of the 
school psychologist, counselor, principal, learning resource teacher, and re- 
ferring teacher) meets to determine the disposition of the case. In evaluating 
this system of placing students, Ysseldyke (1983) reported a number of 
startling findings: (a) the topics of such meetings were dominated by reports 
given by school psychologists and the major focus was on achieved test scores; 
(b) the greater the number of tests reported, the more likely they were 
technically inadequate; (c) the issue of placement decisions was allocated the 
second largest percentage of meeting time and educational planning received 
the least attention; and (d) teachers and parents rarely were asked to offer 
information or to propose educational options. 

A second common complaint has been with the usefulness of some of the 
commercially available tests, even though teachers have reported stand- 
ardized measures to be the most useful in instructional planning (Thurlow & 
Ysseldyke, 1982). There are at least four major reasons for the restricted 
applicability of much of the currently available assessment data: 


1. The technical adequacy of many measures is questionable. 
2. The emphasis has been on reporting test scores rather than on descrip- 


tions of what students typically do. 
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3. In many tests there is an insufficient number of items per ability or skill 
being measured to allow a reasonable estimate of any set of students cur- 
rent level. 

4. The conditions under which these data have been collected do not repre- 
sent classroom performance. 


Proposed Solutions Within the Current Model 


Some solutions have been proposed within the application of the psycho- 
educational model in the attempt to better characterize both what and how 
students with special needs have learned. The first approach is to reexamine 
data collection technology and tests. The second strategy is to change testing 
procedures. 


Expert Systems and Broadening the Test Base 


The first of these proposed solutions has been to generate expert assessment 
batteries. The rationale has been to identify measures that would yield as 
close to perfect error-free diagnostic-prescriptive data as possible. In the 
attempt to devise such an expert system, computer-based automated assess- 
ment systems (e.g., Colburn & McLeod, 1983; Hofmeister & Lubke, 1986) 
and other specialized assessment batteries (e.g., Kaufman & Kaufman, 1983) 
have emerged. The special educator’s role in assessment is again minimized 
with this proposed solution to assessment administration. Scoring and inter- 
pretation follow from using the expert system and often is computer generat- 
ed. 

An alternative approach has been to prepare behavior-specific or subject- 
based assessment instruments. A plethora of such measures, consisting of 
checklists, standardized tests, and nonstandardized published or unpublish- 
ed informal inventories, have been developed (for a discussion see Salvia & 
Ysseldyke, 1988). The task of learning assistance (resource) teachers or other 
school-based special education teachers is much more complicated if they opt 
for this solution. They must select appropriate instruments that when ad- 
ministered and scored will provide useful instructional planning information. 
Wilson (1985) has argued that the process can be simplified considerably by 
using cut-off scores derived from administering group achievement tests as 
the base for placement decisions. He found that when using the 10th percen- 
tile as a cut-off score, approximately 70% of students with special needs were 
classified correctly. The use of such a simplified technique, however, would 
still mean nearly 30% were misclassified. 


Changing the Data Collection Procedure 


As mentioned earlier the most frequently employed assessment technique is 
the administration of standardized tests. Data collection procedures are 
being reexamined within the field of special education with the intention of 
obtaining a better representation of what and how students with special 
needs learn. 

One alternative assessment strategy has been to utilize criterion-refer- 
enced measurement to sample student in-class performance. Two criticisms 
have been stated. First, a criterion-referenced approach to measurement has 
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the implicit assumption that the subskills of any curriculum can be stated 
and items defined to measure such subskills. Second, to fully represent any 
curriculum an inordinate number of items is required; however, in contradic- 
tion, a parallel criticism is that most criterion-referenced tests contain too 
few items per objective to provide useful diagnostic information (Bachor, 
1985). One answer has been to refine the procedures used in criterion-refer- 
enced measurement. In its new form in special education, criterion-refer- 
enced measurement has been termed curriculum based assessment or CBA 
(Deno, 1985; Fuchs, Fuchs, Benowitz, & Barringer, 1987; Marston, Mirkin, 
& Deno, 1984). CBAs are measurement procedures that are designed to 
match students’ program objectives. 

There are two main versions of CBA. In the version suggested by Fuchs 
and her colleagues (Fuchs, Deno, & Mirkin, 1984; Fuchs & Fuchs, 1986; 
Fuchs et al., 1987), a pool of domain-referenced items is constructed to match 
annual goals and instructional objectives. Items, called probes, are then 
sampled from the item pool and administered to students to monitor instruc- 
tional effectiveness. In the version suggested by Deno and his associates 
(Deno, 1985; Deno, Marston, & Mirkin, 1982; Wesson, Fuchs, Tindal, Mirkin, 
& Deno, 1986), quick probes to estimate performance, for example, the 
number of words read correctly in one minute from a basal reader, the 
number of correctly spelled words from dictated lists, and the number of 
words written on a story starter, are obtained. 

Wesson et al. (1986) summarized the use of CBA by stating that using it 
allows a teacher to develop short, simple measures based on curriculum. 
These measures are administered from three to five times a week and 
students’ performance data are graphed and analyzed. This information then 
is used to determine the appropriateness of the students’ individualized 
educational programs. Fuchs and Fuchs (1986) argue that the use of CBA in 
developing and evaluating instructional programs over the long term can be 
expected to have a significant positive effect on the achievement of students 
with special needs. They add that for the use of CBA to be effective, instruc- 
tional goals for individual students must be stated and monitored. In a 
meta-analysis in which the goal was to compare long-term and short-term 
assessment, they noted an effect size interaction. “When progress was mea- 
sured toward long-term goals, effects sizes calculated on global outcome 
measures were higher than on probe-like outcomes. On the other hand, when 
progress was measured toward a series of short-term goals, effect sizes were 
lower on global than on probe-like outcome measures” (Fuchs & Fuchs, 1986, 


p. 78). 


Summary 


While such revisions may be helpful to anyone concerned with diagnostic 
procedures, they must be put into perspective. All collected data must be seen 
to contain error in two senses: 


1. There is the error of measurement associated with the administration of 
any particular tool in the sense that no single procedure, regardless of 
the attractiveness of the assessment technique, can stand alone. 
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2. There is the error of the measurement that accompanies taking time- 
bound samples, in that in taking any individual measurement only what 
a person did at that time can be estimated. In the decision making 
model to be addressed next, these two problems areas are addressed and 
techniques to expand data collection are suggested. 


Modifying the Model: Incorporating Decision Making 
and Classroom Data 


A second assessment model—coined the decision making model (Bachor & 
Crealock, 1986), a decision making procedure (Ysseldyke & Regan, 1980), or 
an instructional-based model (Humphries & Wilson, 1986)—has been sug- 
gested in which the traditional model of information collection has been 
modified and extended. The primary emphasis in implementing the decision 
making model is to obtain instructionally relevant information about both 
students and teaching techniques. Because this model is an extension of the 
traditional one, several hybrids of each are currently being practiced by 
special educators (e.g., Ministry of Education, 1989). 


Some Suggested Assessment Strategies 


Assessment when using this model changes to a multifaceted process design- 
ed to reflect a continuum of strengths and weaknesses. The argument is that 
no one assessment technique will suffice. Stated more positively, the applica- 
tion of all assessment techniques, regardless of validity, can never result in 
more than a situation-specific estimate of what a youngster can do. Thus, to 
obtain a reasonable estimate of performance for any youngster, a number of 
assessment techniques (interviewing, observing, testing) must be employed 
that yield information about performance under a variety of conditions (such 
as when receiving instruction, when working independently, when doing 
something enjoyed, etc.). By following such procedures, information will be 
obtained that will allow an estimate of the degree of difficulty experienced by 
any individual in doing something. Or the use of such procedures will allow a 
special educator to approximate where any student with special needs falls 
along the learning/achievement continuum. In addition, it is recognized that 
a portion of the collected data must be drawn from the youngsters’ home 
classroom. The procedures of cumulative record analyses, comparative and 
continuous observation, and work sample analysis are suggested for this 
purpose. These procedures are accompanied by conducting interviews and 
examining the effectiveness of instructional procedures. 

Starting with the students’ history. In applying the decision making 
model, Bachor and Crealock (1986) argue that the special educator should 
start by examining the track record of students with special needs. They 
recommend classifying all available information as being one of four types: 
historical opinions, current opinions, historical measurements, current 
measurements. In a recent revision, Bachor (1989) suggested that teachers 
should further classify the information in the cumulative files into crude 
commonalities by describing the conditions under which the data were col- 
lected. In completing this procedure they are to describe what the assessor 
did (interviewing, observing, testing, teaching) while presenting any assess- 
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ment task and what the youngster was required to do in response (listen, 
write, read, copy, talk, make). In addition, any available information on how 
the student has been taught and what subject-specific materials have been 
used should be noted, especially if there are data to note the utility of such 
methods and materials. Next, the data are to be grouped (listing the type and 
sources of the collected data) around any probes or tentative hypotheses that 
the teacher wishes to advance as the starting point of further assessment. (A 
probe is a data point that only occurs once but one that the decision maker 
suspects should be checked further. A hypothesis differs from a probe in that 
it must be supported by two or more data points from different sources, 
preferably those that cover a span of time rather than those that were 
conducted on the same day.) 

Sampling classroom performance by observation. As indicated in the 
introduction to this article, when students with special needs are given a 
classroom test they face a different set of conditions than do students who 
are normally achieving. Many of the currently applied classroom and clinical 
assessment procedures are designed so that the obtained data do not reflect 
the knowledge-base of students with special needs. Comparative observation 
is a technique in which teachers examine students’ classroom performance in 
any subject matter. They do so by comparing the performance of the student 
or students in question to a stratified sample of other students in the class. 
To be most useful this sample should be subject matter specific and should 
consist of two to four students who fall into three groupings: those perform- 
ing near the top of the class, those performing in the average range, and 
those having difficulty. Evaluation standards then are based on the reason 
for referral; they can range from amount of work completed in a given period 
to the quality of the completed assignments. 

This comparative evaluation may be accompanied by a continuous chro- 
nology. The purpose of obtaining such a chronology is to provide a descriptive 
summary of the student’s activities in his or her classroom environment. The 
subject matter observed is not critical in this case; however, the conducted 
observation should include as wide a variety of activities as is possible across 
the school day, while avoiding atypical events such as testing periods, in order 
to obtain a complete representation of any student’s general pattern of 
behavior. 

Work sample analysis. Bachor (1979) has described a procedure whereby 
students’ daily classroom performance is used as the basis of assessment. 
This procedure has been revised to allow the regular classroom teacher or the 
special education teacher to obtain a technically adequate sample of class- 
room performance that can be translated directly into instructional objec- 
tives (Bachor, 1984, 1989). Specifically, the following four steps are 
suggested to complete work sample analysis when the purpose is the descrip- 
tion of a single student or a small group of students: 


1. Select three to five assignments completed over a period of two to six 
weeks. The skills or abilities to be observed must be repeated in some 
form in each selected assignment. For each assignment, list strengths 
(abilities or skills at or above grade or age level expectations), skills 
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present (abilities or skills mastered by individual but not meeting grade 
or age level standards), and weaknesses (errors of omission or commis- 
sion). 

2. Look for commonalities in error patterns across all work sample 
analyses. 

3. Compare observations of strengths, skills present, and weaknesses to ex- 
pectations set for students (use curriculum guides or other locally estab- 
lished standards). 

4. Using both commonalities and expectations, prepare diagnostic test 
items to establish approximate boundaries of performance. 


Procedural interviews. Estimates of how students are learning may be 
obtained by interviewing them (Meyers, Pfeffer, & Erlbaum, 1985). Inter- 
views can be conducted in three separate phases. First, students can be asked 
to explain how they plan to do any task. Second, they can be interviewed 
while completing a preselected standardized assignment similar to class- 
room-based activities. Third, they can be questioned for post hoc explana- 
tions of how something was done. Again comparative interviews can be 
conducted to determine, in a normative yet descriptive manner, where one 
student stands in relation to another. 

There are difficulties associated with interviewing, however. If one ad- 
ministers a series of subject-specific problems with the intention of estab- 
lishing procedural knowledge, the problem is that to some lesser or greater 
extent such performance must be inferred. A wide discrepancy in subjects’ 
reading levels, for example, will be observed as variability in text processing 
and likely as restricted word knowledge (Kintsch, 1988). Thus, at a mini- 
mum, steps should be taken to observe subjects’ reading levels and to de- 
scribe their reading strategies across a variety of texts. Similarly, students’ 
abilities to remember material may influence their strategy knowledge and 
utilization, as will their confidence in approaching school-related subjects. 
For example, Steacy & Bachor (1987, 1988) found that functional short-term 
memory span and academic self-concept affected students’ acquisition and 
performance on arithmetic word problems. Thus it appears necessary to 
interview youngsters on strategies across a variety of subject matters to note 
variations in their strategic repertoire and their affective responses to how 
well they are doing. 

Effectiveness of instruction. A final assessment procedure within this 
model is to conduct trial lessons. The purpose here is to evaluate whether any 
suggested instructional method or material is likely to work well with a group 
of students. The criterion of success is whether the student can demonstrate 
mastery of the taught curriculum at the end of the instructional period. 
Given that these procedures are intended for students with special needs, a 
further precaution is taken; that is, can such students remember the instruc- 
tion, and did they remember it over time? 


Summary 


The key difference in implementing a decision making model, as opposed to 
the psychoeducational model, is that the emphasis is on description of perfor- 
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mance. To translate diagnostic procedures into instruction, assessment pro- 
cedures need to be drawn from a variety of sources and should reflect how 
students act under different conditions. Making such modifications in the 
assessment process should enable all professionals to better relate assess- 
ment to required instruction. 


Conclusion 


The position adopted in this article is that yet another directional shift in the 
assessment and instructional practices for students with special needs must 
be taken from a psychoeducational model of assessment to an ecological one. 
This modification in procedure is fairly straightforward. Instead of relying on 
single measurements or time-bound measurements, measurement portfolios 
that represent any youngster’s classroom ecology are required. In building 
such portfolios, the emphasis will need to be on information utility. The 
question of utility can be stated as follows: How do the data being collected 
help to make more effective program and placement decisions? To answer 
this question, it was suggested that descriptive data that span time and 
contain both objective and subjective information are to be collected. Further, 
it was recommended that such descriptive data should be gathered on a 
stratified sample of students to permit portfolio comparison. By collecting 
these assessment portfolios and using the obtained information to monitor 
instructional methods and materials, more effective instructional decisions 
can be made for students with special needs. 
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Time to Replace the Classroom Test 
with Authentic Measurement 


An argument is made that test scores are painfully feeble and fall far short of adequacy as 
measures of educational achievement. We have learned a great deal in the past decade about 
the importance of meaning and context in language teaching, and these ideas are beginning 
to inform our understanding about the learning of mathematics and science. Achievement 
takes on new meaning—not as some general or abstract trait to be captured by a single 
number but instead as performance on meaningful tasks in a recognizable context. The most 
promising tool for the assessment of authentic achievement in the classroom (and perhaps 
for accountability as well) is the portfolio, a systematic, cumulative record of performance. 
Evaluation of portfolios, or “records of achievement” as they are known in the United 
Kingdom, has shown them to be promising ways to truly integrate student evaluation with 
the rest of teaching and learning. 


It has been fashionable to define test as any systematic measurement proce- 
dure, including observations, checklists, and other tools in the category, but 
that broad definition has never caught on. To the vast majority of people, a 
test is a fixed set of questions written on paper to which one responds and 
from which the test administrator derives a test score. Say “classroom test- 
ing” to anyone and they will imagine students “writing” a test. Other evalua- 
tion tools are used in classrooms, of course, but they are not referred to as 
tests. Measures of listening comprehension, for example, have spoken stimu- 
li, but the responses are written and the outcome is a test score. Evaluation 
of spoken language is done through an oral interview—not a test. In this 
article, the word test will be used in the specific sense of written stimuli, 
written responses, and a single number (or a few numbers) as the outcome. 

Bloom’s Taxonomy (Bloom, 1956) struck a responsive chord, but we have 
not seen the expected flowering of items testing the higher levels—synthesis, 
evaluation, and the like. In fact, every study ever done of actual tests has 
found a heavy, often exclusive, reliance on questions that ask only for know- 
ledge. We have built a structure of so-called test theory that was attractive on 
the outside but has remained almost empty as far as achievement tests are 
concerned. It is the thesis of this article that the achievement test structure 
is mostly empty because it is uninhabitable. We have great difficulty seeing 
that test scores are a cop-out, an admission of ignorance we can no longer 
afford; as measures of educational achievement, they are painfully feeble and 
fall far short of adequacy. The persistent urging of tests on classroom teach- 
ers and the neglect by measurement specialists of any other tools continues 
to be a great disservice to education. 


Leslie McLean is a professor and head of the Educational Evaluation Centre. His research 
interests center on student evaluation. 
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The pity is that the structure we call test theory and the instruments 
derived within it threaten to drive out any competitors, despite their 
vacuousness when it comes to the measurement of achievement in the class- 
room. They are vacuous because they are empty of the content that marks 
authentic achievement. A soul-searching article on measurement (i.e., test) 
driven instruction (Airasian, 1988) is illustrative, in that it raises many 
questions about content but provides no answers. While we are waiting, there 
are content-rich alternatives to tests—more positive and attractive tools for 
the evaluation of student achievement. The rest of this article is devoted to a 
description and discussion of an alternative class of tools and to an argument 
as to why they are more likely to be authentic and why they deserve to replace 
testing in the classroom. 


Authenticity 


The concept of validity has long been prominent in psychological testing, 
where nomological nets can be woven. For example, the Myers Briggs Type 
Indicator (MBTI) has a strong claim to validity within a Jungian theoretical 
framework. There has been a call for some years for a theory within which 
interest measures can be grounded; the early enthusiasm for empirical scales 
such as the Strong or Strong-Campbell is giving way to skepticism. In 
achievement tests, however, validity has proved an elusive quality. Credible 
achievement constructs (Haertel, 1985) have not been found. Face validity or 
curricular validity (essentially an overlap with the surface curriculum) has 
had to suffice, but given the complexity of curricula, the tests invariably had 
low validity. Large-scale surveys based on item and person sampling 
achieved much more complete overlap, but they tended to fall of their own 
weight back on multichoice items at the knowledge level. Before the miracle 
of item response models, such surveys did not yield scores for individuals and 
hence were limited to monitoring. They therefore made little or no contribu- 
tion to measurement in the classroom beyond providing extremely rough 
guidelines and items that could improve only the poorest existing practice. 
We now have the Duplex Design claiming to provide individual scores (Bock 
& Mislevy, 1988), but not everyone is willing to accept the claim (Goldstein & 
McLean, 1987). Even if accepted, few jurisdictions will be willing or able to 
afford such large surveys on a regular basis, so impact on classroom measure- 
ment will be minimal. 

Impact on classroom practice can be substantial in that high-stakes tests 
lead to more practice with test items in order to achieve better test scores 
(Corbett & Wilson, 1987). Improved test scores, however, do not lead to 
improved performance on meaningful tasks, as language teachers found a 
decade ago. Reduction of curriculum content to test content, common in 
science and mathematics classes, cannot be described as improvement. Valid- 
ity is now being addressed by what we might call NAEP Norms, citing the 
items answered correctly by high proportions of the survey sample at various 
levels of the test score. This is an improvement on previous practice in that 
the interpretation is based on performance on items we can actually ex- 
amine. When we examine them, however, we are thrown back on prior 
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questions, “Why are these the right items?” and “Why should we select items 
to examine based on this particular test score?” We need, in short, a way to 
authenticate our measures. Authenticity may be a more helpful and useful 
word than validity to describe the quality we seek in achievement measures. 

An articulate call has recently gone out for authentic measures, endorsed 
and distributed by the United States National Association of Secondary 
School Principals (Archbald & Newmann, 1988). The concept of authenticity 
is based where it should be, on the theories and models of curriculum content 
and cognitive processes (where we have them), rather than on abstract 
models of test scores masquerading as theory. There are, for example, com- 
municative or functional language theories that guide curriculum design, 
including the definition and measurement of achievement. There do not 
appear to be comparable theories in other subjects, mathematics and science 
for example, and it may be that there are no others. Perhaps there are only 
different versions of language theory, versions that apply to the different 
symbol systems used in mathematics and science—and music and art. Cogni- 
tive scientists are hard at work on theories of microprocesses and macro- 
strategies that might reach theory status. 

An achievement measure is authentic if it faithfully reflects: 


1. The knowledge and the processes that form the basis of the subject mat- 
ter. (Cognitive scientists refer to executive processes, language theorists 
refer to meaning making and discourse competence. The concept of pro- 
cess is not exactly a firm one yet, but it appears over and over again in 
various guises.) 

2. The conditions under which the achievement normally takes place. 


Some years ago, a seminal study of teacher constructs of the curriculum 
and of students went beyond the surface curriculum (Bussis, Chittenden, & 


Amarel, 1976), and now our achievement measures are starting to follow 
(Swain, 1988; McLean, 1988). 


Subject Matter Processes and Knowledge 


The only subject matter processes and knowledge we understand at all well 
are in the subject of language (Canale & Swain, 1980; Gorman, 1986; 
Thornton, 1986). Three points are salient for this paper. 

The ability to communicate (whether listening or reading, speaking or 
writing) depends crucially on what one already knows—on the knowledge, 
concepts, and beliefs acquired to date. Meaning does not exist in some pure 
form outside us; it is constructed in the process of communication from what 
we already know or believe. Knowledge of the world is therefore crucial to 
communication. 

Discrete point knowledge (isolated words, rules of grammar) tells us 
nothing useful about what a person can do with it. 

Achievement worth knowing about involves subtle changes that happen 
over months. Changes are continuous and cumulative, the result of multiple 
interactions of which schooling is but one. A necessary condition for change 
is the construction of meaning. 
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Conditions Under Which Achievement Takes Place 


The conditions of learning and the conditions for the demonstration of 
achievement have not featured in the design of achievement measures to 
date. Once designed, published tests come with instructions for standard 
administration, but this is a surface feature different from the point being 
made here. Language theories emphasize the importance of context and 
purpose in the shaping of communication, so both pedagogy and measure- 
ment should take these into account. Contexts meaningful to the students 
are required for teaching, hence for achievement measurement. Communica- 
tive language achievement measures present texts of a hundred words or 
more set in a context with which the examinees can identify. It is therefore 
impossible to produce measuring instruments that can be used in the same 
way across a country, or even a province. The instruments (substantial 
booklets are required) can be used, but some parts of them are not appropri- 
ate and universal scoring schemes are not valid. They have to be regarded as 
sample collections of stimuli to be adapted to local conditions. (Examples are 
the reading assessment booklets used by the Assessment of Performance 
Unit in England and Wales and the procedure, A vous la parole, devised by 
the OISE Modern Language Centre for use in French immersion classes. The 
APU instruments were applied across England and Wales, but the reports 
were of disaggregated results and the most valued outcomes have been the 
booklets themselves which have been widely distributed to teachers.) 

Students are increasingly being encouraged to work cooperatively in 
small groups. This is being promoted in language classes, and the umbrella 
administration for Toronto schools recently published a book encouraging 
cooperative small group learning across the curriculum (Metropolitan Toron- 
to School Board, 1988). The OISE Cooperative Learning Centre is busy with 
professional development workshops, and there is an International Associa- 
tion for the Study of Cooperation in Education. If much of the classroom 
learning has taken place in groups, any measurement of achievement that 
relied exclusively on individual measurement would lack authenticity. It may 
be acceptable to require individual products of learning, but students should 
have the opportunity to develop some of them in cooperation with others. If 
one of the goals of the curriculum is for students to learn to work well in 
groups, then some assessment of group processes would appear to be re- 
quired. 


Summary of Implications 


Authentic measurement must involve varied settings and purposes. It must 
include assessments collected over several months, the longer the better. 
Tasks meaningful to the students are required. All these conditions predict 
that performance will vary with tasks in complex ways. Student-task-setting 
interactions will mean that authentic measures have very low indices of 
internal consistency (Swain, 1988). Authenticity will be inversely related to 
traditional estimates of measurement quality, because the traditional mea- 
sures are derived from testing constructs that do not accord with current 
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subject theories. This is not surprising, because test theory was assumed to be 
independent of subject theories. 


The Forms of Authentic Measurement in Classrooms 


Teachers and students in classrooms are uniquely situated to respond to the 
implications of subject theories, at least as we see them in language. The 
implications for achievement definition and measurement are profound. 
What is required is a systematic, cumulative record of performance on mean- 
ingful tasks. The writing folder can be one example of such a record, if the 
teacher sets the tasks systematically and well, in cooperation with the stu- 
dents. Standards are set by the requirements for the contents of the folder 
over the course of a term, or better over a school year. Evaluation of in- 
dividual entries is done jointly by teacher and student, sometimes bringing in 
other students. Students often work cooperatively, but each student retains 
a collection of the work he or she produces. Not all work is marked, but the 
teachers specify in advance a set of assignments that are expected to be in the 
folder for marking. Students choose some of their best work to add to the 
record, negotiating a mark with the teacher. In case of dispute, both marks 
can be assigned (the student’s and the teacher’s) with a rationale. Teachers 
who have tried this report that the negotiations have been some of the most 
valuable learning experiences of the term (Radnor, 1988). 

The generalized form of the systematic, cumulative record is sometimes 
called a portfolio, or just a record of achievement. Records of achievement 
have been the object of study in 10 large pilot schemes in England and Wales 
over the past two years, and the evaluation of those schemes has just been 
published (Broadfoot, James, McMeeking, Nuttall, & Stierer, 1988). Many of 
the schemes experimented with records of achievement in and out of school 
and in nonacademic areas, raising a number of ethical questions. Some 
observers are concerned, for example, that records of achievement that 
include activities outside of school (clubs, part-time jobs) will be used by the 
authorities for surveillance (Hargreaves, in press; E+M Newsletter, 1988- 
89). 


Arriving at School Marks 


Portfolios lend themselves best to category marks such as A, B, C, the 
difference from usual practice being that descriptions of the categories can be 
written in terms of the performance expected in the portfolio. Portfolio 
entries are marked individually and the marks aggregated to arrive at the 
term or final mark. The advantage of such a scheme is that the weighting of 
marks is explicit and visible to students and parents. Teachers in secondary 
schools often weight term tests, quizzes, and assignments, of course, but the 
weighting of content in the tests is rarely known accurately. Percentage 
marks are used, with a spurious appearance of accuracy and meaning. Once 
the complexity (especially variability) of achievement is better known 
through the subject theories, it is easier to accept that category marks (at 
most five) represent the most accurate assessment available to the teacher— 
hence a more honest procedure than the ones usually seen. 
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Reporting to Parents 


Parents receive the usual report cards, with category marks, accompanied 
by definitions of the categories. Portfolios are collected by the teacher at the 
time of parent interviews so that conversations can be conducted in terms of 
the demonstrated performance of the student. (Parents may have seen the 
portfolio all along, of course, because it is in the student’s possession most of 
the time.) There is bound to be some normative interpretation when the 
teacher justifies the standards set for the various marks, for example, “These 
assignments were completed successfully by all but three students in the 
class, but the extensions proved much more difficult. Your daughter did all 
the difficult ones.” The essential point is that achievement is not expressed in 
abstract terms interpretable only normatively or by the teacher (if then). The 
parent can see what was required, what was submitted, and what evaluation 
the student gave the work. 


Accountability in General 


The tasks on which students work in cooperative small groups perhaps 
contain the skills, knowledge, synthesis, evaluation, and the like that define 
the learning framework. The term, “learning framework” is used because the 
boundaries are not fixed in advance. As with curriculum guidelines, mini- 
mum but not maximum targets are set. Unlike guidelines, the targets are 
expressed in terms of meaningful tasks that contain the skills and knowledge 
that now make up most guidelines. Accountability to the principal, board, 
and Ministry may be expressed in terms of targets set and targets attained, 
supplemented with reports of excellent work done over and above minimum 
expectations. The tasks should be listed, and the performance required, and 
then summaries of attainments provided that will be at least as informative 
as the current average percentages and test scores. Convincing all stake- 
holders of this is likely to take considerable time and effort. There are those 
who cannot live without tests and test scores. Unless schools everywhere 
begin to use the same tasks and report on them in the same way, there will 
still be demands to know where the schools stand in relation to one another. 
The most defensible educational solution will be for provinces or boards to 
survey achievement with a standard set of tasks, using light sampling to take 
a minimum of instructional time. If standard tests are the only means some 
in authority will accept, however, then they will have to be incorporated into 
the classroom work to give the students adequate practice. We will know we 
are making progress in education when there is widespread criticism of such 
practices, especially from those specializing in measurement, pointing out 
that they are educationally unsound and disruptive of teaching and learning. 
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Assessment in the Classroom: 
What is the Role of Research? 


The foregoing papers were reviewed from a perspective based on five propositions about 
classroom assessment. These refer to the nature of achievement, the information about 
students needed to interpret assessments of achievement, the tension that exists between the 
need to describe and the need to appraise, the distinction between formal and informal 
assessments, and the need for fairness. 


This discussion of the papers commissioned for the Victoria Conference is 
divided into two main sections: The first consists of a perspective on class- 
room assessment in which several propositions about the nature of the 
enterprise are advanced. (I have chosen to substitute the phrase classroom 
assessment for classroom testing. Testing suggests particular kinds of ac- 
tivities, whereas assessment is less specific in connotation and is often used 
to encompass any reasonable means imaginable for collecting information 
about student achievement.) The second section contains comments on the 
papers and suggestions for future research. 


Perspective 


The perspective derives from five propositions: 
Proposition I: Scholastic achievement is inferred from student behavior and 
from products of that behavior. 

The evidence we obtain of a student’s achievement derives from his or her 
performance of particular tasks on particular occasions. This evidence con- 
sists of an evaluation either of the performance of the task at the time it 
occurs (e.g., an adjudicator’s rating of the performance of a composition on a 
musical instrument or a teacher’s appraisal of the conduct of a laboratory 
experiment) or an evaluation of a product resulting from the task perfor- 
mance, this evaluation often being made some time after the performance 
(e.g., the teacher’s appraisal of a student’s answers to the questions in a 
written test). The evidence collected is, for most kinds of achievement, only a 
sample of the myriad kinds of evidence it would be possible to collect. So, 
although we refer to the performance or the products that result from the 
performance of particular tasks, our interest usually extends beyond par- 
ticular performances of particular tasks to the performance of tasks similar 
to those that were attempted and perhaps also to the performance of tasks 
that are in some respects unlike those that were performed. From this 
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perspective, the high achieving student is one believed capable of behaving in 
the future with reference to familiar tasks, and possibly also with reference 
to unfamiliar tasks, in a way that will be judged acceptable or appropriate or 
noteworthy. An associated issue, then, is how widely the expectation of 
achievement for a student can be generalized: What limits should be imposed 
on the inference made from the sample of behavior and behavior products 
that served as the basis for the inference of achievement? 

Proposition II: Achievement cannot be understood or interpreted without 
reference to the conation and affect of the actor, as well as her or his 
cognition. 

The point of this proposition is that human behavior is a complex phenom- 
enon, so making correct interpretations of the available evidence regarding a 
student’s achievement is a hazardous exercise. According to the Oxford 
Shorter English Dictionary, cognition is “the action or faculty of knowing; 
knowledge, consciousness; a product of such an action.” Conation is “en- 
deavor; the faculty of volition and desire; also the product of this faculty.” 
And affect is “mental disposition; desire, passion.” Clearly, achievement is 
more than knowledge (cognition), for the behavior on which the assessment 
of achievement rests involves conative and affective components as well. In 
addition to knowing how to behave in ways that are appropriate and that 
serve to inform the observer, either through the behavior or a residue of that 
behavior, a student must also want to behave in a way that shows he or she 
possesses a particular kind of knowledge or ability. This may seem an obvious 
point, but we must be ever mindful of the fact that students who do not 
willingly perform tasks of the sort that provide evidence of knowledge and 
skill may be labeled disabled or underachievers for the wrong reason; namely, 
for the reason that they lack of knowledge or skill or ability, when instead 
they lack will or interest. 

Proposition III: There is a tension in classroom assessment between describ- 
ing and appraising student achievement. 

Description may be characterized as the act of preparing a list or narra- 
tive summary of a student’s behavior, including, perhaps, a collection of 
associated products, all of which, in the view of the person preparing the 
description, reflect achievement and support whatever inference is made of 
achievement. Appraisal goes at least one step beyond description and invol- 
ves making judgments of good/bad, acceptable/unacceptable, strong/weak, 
high/low. This is necessarily done with reference to a set of norms or bench- 
marks, whether explicit or implicit. Assessments of achievement serve dif- 
ferent purposes, but I cannot think of one that is wholly satisfied by complete 
reliance on description. The diagnostic purpose may come closest to being 
satisfied by description. The foremost consideration in diagnosis would seem 
to be description that reveals to the student and his or her mentor/ 
coach/teacher that which is inadequate about a behavior or product, and that 
which needs to change in the student’s cognition/conation/affect to alter the 
task performance in appropriate ways. Even here, however, there is implied 
contrast, between an actual behavior/product and a satisfactory or desired 
behavior/product. Thus good description is insufficient even for diagnosis. As 
Robert Wilson noted in the paper he prepared for this conference, description 
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is the main function of student assessment in the earliest grades of school. 
But even then, and certainly in later grades, it is expected that the teacher 
will judge whether a student’s behavior and the products generated by that 
behavior satisfy expectations for the student’s grade level. These expecta- 
tions might be explicitly defined in a norms table, which is a summary of the 
way students representative of some well-defined population performed a 
standard set of tasks. More frequently, however, the student’s behavior and 
the products of that behavior will be referred to expectations that the teacher 
has developed and internalized through experience, both with students and 
with subject matter or curriculum. Even the so-called criterion-referenced 
test involves a normative component whenever a student’s performance on 
such a device is used to support a judgment of adequacy of achievement. 
Proposition IV: In the act of appraising student achievement, a distinction 
can be drawn between informal and formal evaluations. 

Informal evaluations of student achievement, as this notion is used here, 

are based on evidence obtained using tasks that are not standardized and for 
which norms are usually not available. Formal evaluations of student 
achievement are based on evidence obtained using standardized tasks and, 
typically, with reference to norms for judging the relative quality of the task 
performance. (Norms reflect variation in the performance of a standardized 
task by persons representing a clearly defined population.) A standardized 
task, analogous to a standardized test, is one designed for use by different 
persons in different places. It is intended that every use of a standardized 
task be governed by prescriptive rules for administration and scoring, these 
rules having been developed so that, when they are followed, the measure- 
ments or scores achieved by different persons in different places can be 
compared directly. Many, perhaps most, of the tasks used by teachers to 
assess student achievement are not standardized, hence they yield informal 
evaluations of achievement. Examples of formal evaluations include those 
based on student performance on province- or state-wide examinations in a 
subject, the norms being derived from the distribution of scores for all the 
students administered the examination. 
Proposition V: Evaluations of student achievement that are used for selec- 
tion or for determinations of pass/fail seem to demand, for the sake of 
fairness and equity, some use of standardized procedures and reference to 
norms. 

The issue addressed in this proposition is two-sided. On the one hand is 
the matter of comparability of evidence of achievement from student to 
student within a class, and from class to class within a school, board, prov- 
ince, or country. A belief many in society hold is that an order of merit should 
be established among schoolchildren, an order that reflects their relative 
levels of achievement of more-or-less similar knowledge and skills. Now, 
there would be no need for an order of merit and for comparability of 
evidence (a) were there no need to make selection and pass-fail decisions 
affecting an individual’s progress toward his or her goals in life, and (b) were 
there no expectation on the part of parents and society at large that assess- 
ments of how well a child is progressing in school would be provided in the 
normal course of events. Even recognizing these considerations, it seems 
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clear that comparability is not the end-all of achievement assessment. It is 
obviously desirable for students to develop in accordance with their different 
abilities and interests and personal goals. The question, when assessments of 
achievement are being made, is how to make fair comparisons among stu- 
dents who have attempted to achieve different things in school. 

Teachers collect a variety of information on the achievements of students. 
Some of this information is informal and not comparable from one student to 
another. Consider, for example, the assessments made by means of projects, 
where each student undertakes a project of his or her own choice. Other 
information may be comparable from student to student within the same 
classroom, for example, scores on a test written by all students under more- 
or-less standard conditions. What was called formal evaluation in Proposi- 
tion IV seems required for comparability of assessments across classes and 
schools. The results of formal evaluations can play a role in aligning the 
marking scales of informal evaluations, where different students are judged 
for their performances of different tasks. The difficult task for the teacher is 
to combine all the information, both formal and informal in the senses used 
here, into a composite that provides a basis for making fair comparisons of 
different students. The way in which all this information can or should be 
combined is a largely unresolved issue. 


Some Comments on the Papers and Some Thoughts 
About Research 


Six of the seven papers commissioned for this volume can be sorted into three 
loosely defined categories that relate directly to classroom assessment. The 
first category includes the reports of how student assessment is being done in 
British Columbia or Ontario or both. Here are the papers by David Bateson 
and Robert Wilson. These papers build on and otherwise tie into the results 
of other research conducted in Canada (e.g., McLean, 1985) and elsewhere. 
The second category includes the three papers in which procedures for chang- 
ing, if not revolutionizing, classroom assessment are proposed. These were 
prepared by Dan Bachor, Clement Dassa, and Leslie McLean. Alone in 
category three is the contribution of Todd Rogers. His is a paper about 
student assessment and educational policy at the provincial level, with the 
issue being that of how provincial policy affects classroom practice in the 
domain of student assessment. The remaining paper by Philip Nagy and 
Rosalyn Moorhead lies outside a focus on classrooms and addresses an issue 
in principal leadership, albeit an issue rooted in classroom assessment. The 
research reported by Nagy and Moorhead is remarkable for the creative 
approaches they devised for summarizing qualitative, verbal reports. 


Never Enough 


The studies reported by Bateson and Wilson employed very different method- 
ologies, very different samples, and were addressed to very different ques- 
tions. Although both were essentially descriptive studies, Wilson’s study 
differed from Bateson’s in a way worthy of notice: Underlying Wilson’s work 
was a theoretical framework, which informed the design of the study and the 
interpretation made of the results. 
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As interesting as descriptive research of the kinds reported by Bateson 
and Wilson is there never seems to be enough of it, or the right work seems 
not to have been done, to explain phenomena that arise in other contexts. For 
example, I read the Bateson and Wilson papers with the hope of finding clues 
to an explanation of the different rankings of British Columbia and Ontario 
13-year-olds in the [AEP study, A World of Differences (Lapointe, Mead, & 
Phillips, 1989). One hypothesis is that British Columbia students have had 
greater experience with multiple-choice science tests like that used in the 
IAEP study than students in Ontario. I tried to confirm this in Wilson’s 
report but was disappointed to find that science was not a subject of report in 
Wilson’s Ontario sample at the 7th and 8th grade levels: the grades, in 
Ontario at least, where the vast majority of 13-year-olds are found. Of course, 
Wilson’s study was not intended to yield widely generalizable results, so even 
if his study had included one or two 7th or 8th grade science classes, the 
evidence would have provided a far-from-conclusive test of the hypothesis. 
The foregoing example suggests a kind of research that may be needed: 
research into the effects of the mode of classroom evaluation on quality of 
student response to the tasks included in external assessments when those 
assessments employ, on the one hand, a mode frequently used in the class- 
room and, on the other hand, a mode infrequently used in the classroom. 
Many research questions of this kind could be posed, of course, so the matter 
of priorities must be addressed. I have no framework to offer for setting 
priorities. There is need for a thoroughgoing review of research on classroom 
assessment, with a view to constructing a framework on which to base 
priorities and funding decisions. 


Advocates of Change 


The papers by Bachor and McLean complement one another. Both call for 
changes in the way achievement is assessed in school classrooms. McLean 
sees the way ahead as that of collecting systematically the products that have 
resulted from student responses to “meaningful tasks.” As I understand the 
term, a meaningful task is one that both student and teacher agree is 
relevant to a specified educational objective or kind of achievement. The 
evaluation of a student’s response to a meaningful task would consist of a 
mark on a suitable scale of quality or merit. Bachor too, but for different 
reasons, sees merit in the idea of broadening the data base of classroom 
assessment. Bachor is addressing the situation in which students may not 
provide much evidence of achievement and are highly unlikely to be much 
interested in performing tasks on demand. In his scheme, the assessment 
data include observations of a sample of classroom performance, an analysis 
of a sample of work, a record of an interview of the student as he or she 
tackles a series of tasks, and a record of the student’s response to a trial 
lesson. 

The suggestions of Bachor and McLean are timely and appropriate. To 
assess a student’s achievement is to make an inference based on evidence 
(the performance of tasks). There is no good reason to limit the evidential 
basis on which that inference rests; indeed, the narrower the evidential basis 
underlying an assessment of student achievement, the less generalizable the 
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assessment will be to the performance of other tasks, whether more or less 
similar. Performance of paper-and-pencil tests is altogether too narrow a 
base for assessing student achievement in the classroom. Inherent in the 
recommendations of Bachor and McLean, however, are problems that must 
be addressed. Most serious, perhaps, is that implementation of these recom- 
mendations might result in situations where the evidence in the record for 
one student is very different from the evidence in the record for another 
student. Such a lack of comparability would mean that direct comparisons of 
the achievement of one student against that of another, as in performance of 
the same task, would not be possible. Were this the case, such comparisons as 
were made would be done subjectively by the teacher, using whatever norms 
he or she had formed on the basis of experience for the different tasks and 
whatever subjective equating he or she had made of the different tasks and 
norms. Research is needed to provide a better description than we now 
possess of how teachers, operating independently, attempt to keep their 
marks in line with those given by other teachers of other students in the same 
grade or course, if indeed they attempt to do so at all. In his presentation, 
McLean passed lightly over the issue of assigning grades to cumulative 
records of achievement. (In a personal communication, McLean indicates 
that he has dealt with this issue in other reports.) How might it be ensured 
that a mark of A (or B or C) means more or less the same thing when 
assigned to different records of achievement? A solution, however imperfect, 
would be to ensure that every student’s portfolio included several records of 
performance of exactly the same tasks. This information would provide a 
basis for adjusting the subjective and noncomparable scales that are used for 
assessing performances of tasks that are different from student to student. 
Time constraints on teachers being what they are, I imagine the written 
classroom test will continue to serve as the principal means of aligning the 
portfolio assessments of different students in the same class. In addition, of 
course, there is need either for the sharing of tests among teachers or for the 
occasional use of standardized tests so that teachers in different classrooms 
and schools have more or less comparable expectations of student achieve- 
ment. Fairness and equity, if not society, demand no less. 

A final point about the Bachor/McLean proposals: As desirable as a world 
without measurement error would be, such a world cannot be legislated into 
existence. The grades associated with cumulative records of achievement 
cannot be interpreted properly unless evidence of components of error vari- 
ance is systematically collected. Thus the McLean and Bachor proposals 
include the germ of a research agenda. High on this agenda is research to 
identify the salient dimensions of variation in teacher judgments of portfolios 
or cumulative records of achievement. This might be accomplished using the 
multidimensional scaling approach employed by Glass (1967) in another 
context. Another item on the agenda is the conduct of generalizability studies 
(Brennan, 1983) to assess the magnitude of the components of variance 
associated with the marks teachers assign student records. (An alternative 
approach, suggested by McLean in a personal communication, is to create, if 
possible, a theory of measurement that applies in situations where the data 
are not scores on some continuous scale but, instead, are assignments to one 
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of a few categories, these assignments being made under the conditions that 
the number of students is small, as in the typical classroom, and the assess- 
ments, that is the assignments to categories, are made frequently.) 

Dassa’s approach to reforming classroom assessment is very different 
from that recommended by Bachor and McLean. Disillusioned from the 
experience of prescribing a method of reform based on a particular model, in 
this case the mastery learning model of formative evaluation, Dassa has 
taken the approach of meeting with teachers and, as I understand it, working 
with them to identify and solve their assessment problems. The cornerstone 
of Dassa’s approach is to gain the trust of each teacher, thus to be in a 
position to bring his or her expertise to bear on their problems. Dassa’s goal, 
it seems, is to bring the assessment approaches of different teachers as 
closely into line with a formative evaluation model as possible. This approach 
is reminiscent of that taken by Sarason (1971) and his colleagues two de- 
cades ago as they endeavored to change practice in a school near New Haven, 
Connecticut. In the course of his work, Dassa expects to define a “research 
paradigm centered on the classroom.” I can only wish him well in this pursuit 
and eagerly look forward to reading an evaluation of its success. 


Policy Considerations 


Rogers’ paper gives us an excellent account of power politics as played in the 
assessment arena. Clearly, governments that see a need for increased exter- 
nal surveillance of student achievement, and that have public support for 
such surveillance, will introduce policies and associated practices that are 
likely to affect classroom assessment of student achievement. What is needed 
in this connection are studies of assessment policies and practices at all levels 
from the province (state) to the school board to the school to the classroom 
(teacher), both before and after public policy and practice have been changed. 
This will make it possible to associate changes at the different levels with 
changes at the provincial (state) level. The resulting information will en- 
lighten any debate as to the possible effects of provincial policy and practice 
on the day-to-day experiences of students. It is gratifying to know that 
British Columbia is funding studies of the possible effects on students, teach- 
ers, and teaching practices of the Grade 12 examination program and the 
British Columbia assessment program. It is to be hoped that government 
officials in other jurisdictions will see the need for, and fund, this kind of 
research. Although not a research agendum per se, members of the measure- 
ment community should press for such studies. 


References 


Brennan, R.L. (1983). Elements of generalizability theory. lowa City: American College 


Testing Program. 
Glass, G.V. (1967). Factors in teachers’ perceptions of students. Journal of Educational 


Measurement, 4(2), 87-93. 
Lapointe, A.E., Mead, N.A., & Phillips, G.W. (1989). A world of differences, Princeton, NJ: 


Educational Testing Service. 
McLean, L.D. (1985). The craft of student evaluation in Canada. Toronto: Canadian 


Education Association. 
Sarason, S.B. (1971). The culture of the school and the problem of change. Boston: Allyn & 


Bacon. 


91 


The Alberta Journal of Educational Research Vol. XXXVI, No. 1, March 1990, 92-97 


RICHARD J. STIGGINS 


Northwest Regional Educational Laboratory, Portland, Oregon 


Toward a Relevant Classroom 
Assessment Research Agenda 


This commentary underlies the need to pursue measurement research in the classroom 
setting. We need to describe assessment processes as they occur and understand their 
effects. From the papers which support the new research agenda, implications for practice 
are drawn. 


Considered collectively, the papers presented at this conference provide a 
broad perspective on the state of classroom assessment research, develop- 
ment, and training. As I read the papers, heard the presentations, and 
participated in discussions, I tried to identify emerging themes in this grow- 
ing field of study. In my summary, I would like to share some of my thoughts 
regarding those themes, focusing on the goals or objectives that appear to be 
guiding our research, the research methods we are using, some of the results 
being attained, and emerging implications for future work. 


Research Goals and Objectives 


Clearly as a community of scholars and practitioners, we have made a 
complete transition from caring totally about standardized tests to caring far 
more about better classroom assessments and better classroom-level decision 
making. These papers attest to this fact. 

Further, it is clear and appropriate that our most important research 
objective has been to describe the status of classroom assessment methods. 
We are describing many aspects of that status, including the various assess- 
ment methods teachers use across different student and subject matter 
contexts. We are documenting elementary and secondary assessment dif- 
ferences, as well as the unique dimensions of assessment across such content 
areas as mathematics, writing, speaking, and science. Papers by Bateson, 
Wilson, and Bachor represent examples of such descriptions. Through this 
work, we are showing that assessment is not assessment. Rather, as the 
nature of the achievement target varies across grades and subjects, so do the 
appropriate assessment methods. As a result, we now know that teachers 
face complex task demands as they strive to track the achievement of their 
students. 

Further, in our research at the Northwest Regional Educational Labora- 
tory (NWREL), we have joined forces with those striving to document the 
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nature and quality of the training and technical assistance provided to 
teachers to help them meet those task demands. Documentary evidence is 
now in hand to corroborate what we have always sensed but rarely verbal- 
ized: that training is totally inadequate if it is presented at all, and technical 
backup in assessment is virtually nonexistent for teachers. 

As we set our objectives for the future, we should continue to describe the 
nature and quality of classroom assessment processes in as much rich detail 
as we can. But in addition, let us follow the path spelled out by Dassa in his 
paper and strive to understand the effects or implications of the various 
assessment processes teachers use on student development. In my opinion, 
this represents one of the next two critical classroom assessment research 
frontiers: We must strive to understand how our assessments impact stu- 
dents, who are, after all, the most important decision makers relying on the 
results of those assessments. Students set expectations and decide how to 
relate to school on the basis of data provided to them by their teachers. If they 
receive sound data, they make sound decisions. If they receive unsound data, 
they make unsound decisions. They and they alone are the beneficiaries or 
victims of classroom assessment. Yet we know so little about how they 
regard, evaluate, and act on the data they receive. 

The second great research frontier we face is that of teacher and ad- 
ministrator training in assessment. How can such a critical component in the 
professional preparation of teachers have been so completely ignored or 
mismanaged? Researchers can investigate and understand the reasons for 
the lack of relevant training. We can find ways to develop and deliver 
relevant training. We understand the meaning of quality assessment. We can 
demystify our technical concepts. The standards of acceptable training are 
not mysterious. Our research task is to find meaningful ways to communi- 
cate those standards to educators who must strive to meet them. 

In sum, these papers confirm that we are aspiring to appropriate goals, as 
we continue to describe the complexities of classroom assessment processes 
and environments. In addition, we must work to understand the critical 
effects of assessment on students and we must begin to sort out effective 
strategies for rendering assessment training relevant and helpful for prac- 
titioners. 


Research Methods 


The research methods presented in these papers are important because they 
represent promising departures from the traditional array of methods used 
in psychometric research. The idea, for example, of presenting educators 
with a specific assessment problem and observing how they attack it, as Nagy 
and Moorhead have done, opens a range of possibilities for describing and 
understanding the current state of classroom assessment. The examination 
of special education assessment methods in terms of their implications for 
the general student population, as Bachor has done, demonstrates a kind of 
professional reflection that can enhance the quality of classroom assessment. 
And finally, the experimental method demonstrated by Dassa reminds us of 
the continuing value of effects research. 
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All of these research methods hold the promise of expanding our wisdom 
in the arena of classroom assessment. At NWREL, we have relied on all of 
these methods to build our knowledge base. In fact, we feel we experienced a 
major breakthrough in our understanding of the task demands of classroom 
assessment when we became ethnographers and entered classrooms as par- 
ticipant observers with the goal of understanding the assessment experience 
from teacher and student perspectives. As we pursue our goal of continuing 
to describe and understand day-to-day assessment processes, we continue to 
rely on this methodology. Our next research goal is to find cases of success— 
outstanding classroom assessment environments—so we can contrast them 
to less successful classroom assessment environments. 

As we look to the future, I believe we attain maximum success in under- 
standing the nature and quality of classroom assessment if we rely on 
research methods that emphasize depth of information more than generaliz- 
ability of results. That means small samples viewed with powerful micro- 
scopes. However, as we strive to find ways to enhance the quality of 
assessment, we might return to more classical research paradigms. We can 
evaluate various training methods via experimental and quasi-experimental 
approaches to find those that have maximum impact. As we conduct these 
studies, we must examine the impact of training on several dependent vari- 
ables: trainee knowledge of assessment practice, implementation of sound 
practice, confidence in assessment results, and impact on student learning. 


Research Results 


Our various research results are coalescing around consistent and important 
themes: (a) the day-to-day assessment of student achievement in many 
classrooms is complex and demanding and (b) is not producing the high- 
quality data for decision making that we would like, because (c) those teach- 
ers and administrators who serve in the front lines of education have not 
been provided basic training in classroom assessment methodology. We know 
what assessment methods teachers use and the extent of their use. We know 
that teachers rely on far more than objective paper-and-pencil tests. We 
understand many of the assessment quality control problems that arise in 
classrooms and we know about the frequency of their occurrence. As McLean 
has told us in his paper, we know a great deal about the “craft knowledge” 
that exists in classrooms about the assessment process. Further, we know 
that that craft knowledge does not reflect the sense of assessment captured 
in the traditional standardized test. 

In short, we understand a great deal about classroom assessment, even if 
we have a great deal yet to learn. These papers suggest that we should be 
encouraged by a decade of research that has taken us this far. The next 
decade will carry us far beyond. We know the quality of classroom assess- 
ment can improve. 


Implications for the Future 


As we work for that improvement, here are some key perspectives we need to 
keep in mind: 
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1. Policy makers will play a critical role in improving classroom assess- 
ment. The assessment community knows that teachers and administrators 
are in desperate need of assessment training; teachers and administrators 
know they need assessment training, but policy makers often are not aware 
of this gap in professional preparation. As a result, they make policy, as 
Rogers reveals in his paper, on the assumption that practitioners can handle 
all relevant assessment issues. For example, they approve plans for a prov- 
ince-wide ungraded primary on the assumption that teachers are prepared to 
handle the considerable assessment challenge that accompanies the need to 
individualize instruction. If this assumption is invalid (and we know it is for 
many teachers), such a policy may result in a program doomed to fail before 
it begins. 

Policy makers need to (a) become literate in the basic principles of sound 
assessment themselves, and (b) become aware of inadequate assessment 
training provided to the educators who will implement their policies. The 
assessment community must take the lead in training policy makers and 
sensitizing them to these issues. 

2. Classroom assessment is a domain of professional practice that invol- 
ves a wide variety of assessment methods. Many in our society believe that 
most, if not all, of our valued educational outcomes can be translated into 
multiple-choice test items. We must help them overcome this misunderstand- 
ing. 

As educators, we help students to hit many different kinds of achievement 
targets: content, thinking skills, behaviors, and products. Some targets can 
be translated into objective test items; others cannot. Those who educate and 
those who view education as interested outsiders need to understand that the 
assessment of the full range of valued outcomes requires the full range of 
assessment tools: paper-and-pencil tests, performance assessments (observa- 
tion of and judgment about behaviors and products), and personal com- 
munication with students. Each represents a valuable assessment tool when 
used well in the proper decision context. All must be made aware of this fact. 

3. In the past, practicing educators have either not been motivated or not 
been given the opportunity to learn about assessment methodology. This is no 
longer acceptable. We need to conduct the research needed to understand 
how to motivate (a) teacher training institutions to provide relevant assess- 
ment training, and (b) teachers and administrators to take advantage of 
training opportunities. We have at least two strategies available for reaching 
these goals: force or finesse. We can force participation in training through 
power of law. Teacher training institutions can be made to adhere to revised 
standards of acceptable assessment training. Teachers can be made to par- 
ticipate in training through revised program requirements. They can be 
made to pass rigorous certification tests revised to cover assessment method- 
ology. 

Or we can make available to educators relevant and helpful training and 
encourage participation to grow through spreading interest in new approach- 
es to classroom assessment. In this case, all training would be offered and 
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taken on a voluntary basis. However, it would be made to be very attractive 
training. 

We have in hand both the background information and teaching strate- 
gies to provide relevant, helpful assessment training. We must now research 
the best way to motivate educators (and noneducators alike) to take ad- 
vantage of that training. If finesse will work, it is the best option. But if 
finesse is insufficient, the force of law may be needed. 

4. Traditional measurement courses, per se, won’t meet the training needs 
of all teachers or all contexts. We need alternative modes for presenting 
training. We need training strategies and modes of presentation that fit both 
the preservice training context and the huge inservice training need. Written 
training materials, video training, audiocassettes, and other available in- 
structional technology can be used profitably to communicate keys to effec- 
tive assessment using both extended presentations and more efficient brief 
contacts. We must explore all such options. 

5. Existing technology promises to revolutionize the gathering and use of 
information in the classroom, if we learn to apply it. Research is needed on 
strategies for using optical scanners, database management systems, word 
processing, gradebook software, video presentations of information, com- 
puter item banks, and other advanced technology to ease the information 
management load on teachers and administrators. The technology exists; we 
must learn to use it to advantage. If we wish to “sell” sound classroom 
assessment to teachers as a process that can make their jobs faster, easier, 
and better in that order, the use of available technology is an excellent way to 
begin. 

However, we must take great care not to offer technology as a substitute 
for understanding the essentials of sound assessment. For instance, we 
cannot profitably offer banks of previously-developed test items to teachers 
as a substitute for teaching the users how to write and select good test items. 

Without the basic assessment training, they will not know how to use the 
item bank to maximum advantage. The bank will come to represent just 
another variation on the same old test construction hassle and they will not 
use it. Only those who understand sound assessment can use advanced 
technology to improve assessment quality and efficiency. 

6. Excellent research is being conducted on classroom assessment issues, 
yet it is not being published and disseminated for critical review, evaluation, 
and synthesis. We must develop more and better ways to include reports of 
classroom assessment research in scholarly journals. The Surrey, British 
Columbia School District’s journal Research Forum (a journal devoted to 
educational practice and theory) represents an excellent example of one 
attempt to find new ways to communicate. Other districts and professional 
associations might follow this lead. In addition, research on classroom assess- 
ment issues, with its many and varied research methods, needs to be elevated 
to an appropriate level of acceptability for university faculty tenure and 
promotion evaluations and for dissertation and thesis research in graduate 
programs. Through these steps and others like them we can both encourage 
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and spread word of important classroom assessment research and develop- 
ment. 


Conclusion 


The lessons of history tell us that when assessment research, development, 
and training agendas mix topics related to large-scale, high-stakes tests with 
topics related to classroom assessment, the much less visible and less politi- 
cally important classroom assessment issues almost always lose. High-stakes 
tests are like sponges, drinking up the resources. 

The time has come to separate the two. They represent different dis- 
ciplines, resting on different underlying assumptions and paradigms. They 
serve different decisions and different decision makers. The methods of 
classroom assessment are more diverse, complex, and demanding than those 
of high-stakes testing. The time has come to separate them. The effect of 
doing so has been demonstrated by this challenging, stimulating, and excit- 
ing conference. The focus was narrowed. We discussed not the improvement 
of all of school testing, but the improvement of classroom assessment. The 
result is a conference that suggests specific action. Let us begin. 
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On the Graying of Academe 


Recently the graduate students in our department were asked to assess their 
doctoral programs. Among their comments was the observation that “faculty 
members should display interest in conducting research, and should be more 
actively involved in research in order to act as mentors for students (rather 
than publishing ‘opinion’ articles and textbooks).” Both as an editor and asa 
member of a 40-member department, I find that this observation is not far 
off the mark. Why is it, how did it come to pass, and what if anything should 
be done about it? 

There is no doubt that our faculties of education are aging. Most of us 
arrived in the late ’60s and early ’70s full of energy and good ideas, fired by 
recently completed doctoral programs and ready to create the new era in 
education. We could teach our classes, guide several graduate students, raise 
families, and do research. Those were heady times. Then slowly we got older; 
the pressures changed; we accepted more responsibility in the administration 
of our institutions and their programs; there didn’t seem to be as much help 
around; we didn’t have as much time to read and keep up. So, like squirrels 
in winter, we began to live off the seeds of our stored experience. Put in its 
best light, we accepted the life of the sage. This would have been fine except 
that there was no one coming behind us to keep our earlier activity alive. New 
people were not being hired so the character of the community became more 
reflective. 

Of course there were significant exceptions to this, none of whom I will 
name for fear of missing some, but my experience on grant adjudication 
committees, attendance at conferences, and work on the editorial boards of 
two major Canadian journals echoes the students’ claim that the preference 
is to write about research rather than to do it. 

Should something be done about it? In some respects it is part of the 
natural process of aging that prompts us to be more reflective, to become 
more philosophical about the passing stream of educational thought. More- 
over, the rise and fall of central or key ideas which seemed so promising in 
their time but yielded little in the long run encourages cynicism among 
graying researchers. (Whatever happened to open classes, direct to indirect 
teaching ratios, advanced organizers, cross-pattern crawling, in-basket tests, 
differentiated staffing?) 

Recent issues of University Affairs and the CAUT Bulletin warn me that 
things are about to change. Where once I could quickly read from economics 
to engineering without finding an interesting position for the graduating 
educational researcher, now I see many opportunities. The influx of these 
new people into our faculties of education is going to provide the energy and 
creativity that will stimulate a resurgence of research. 
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I don’t want to leave the impression that the retreat into reflection that 
characterizes my generation is without merit. We are producing a change in 
the nature of educational research. Whereas 20 years ago many of us viewed 
research as being about education with the goal of informing practice, now 
with the wisdom of experience we encourage our graduate students to make 
research part of education by acknowledging the reciprocal relation between 
research and practice. 


T.0O.M. 
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The Image of Community! 


Teachers develop and use a special kind of knowledge in their teaching which has 
been defined as personal practical knowledge. Clandinin (1985) proposes a con- 
ception of personal practical knowledge as one in which theoretical and practical 
knowledge are blended by the personal background and characteristics of the 
teacher and experience in particular situations. The concept of “image” is one 
form of personal practical knowledge (Clandinin, 1985). In this article I describe 
how an image embodies the personal and professional experiences and is ex- 
pressed in classroom practices by two interns who participated in Alberta’s Initia- 
tion to Teaching Project implemented in 1985. Data were collected through field 
notes recorded on site during the visits to classrooms, notes collected and recorded 
during planning activities and other activities in which the intern participated, 
and interviews held with each intern throughout the time of the study from 
September 15, 1986 to June 15, 1987. What this study suggests is that not only do 
images function in the lives of classroom teachers, but that an image such as 
“classroom as community” can be played out differently in classroom practices 
and teacher talk. 


Introduction 

There has been a recent upsurge of interest in research on teacher thinking 
(Clark & Peterson, 1986; Clark & Yinger, 1977). One focus in research on 
teacher thinking has been on the special kind of knowledge teachers develop 
and use in their classroom practice. This knowledge has been referred to as 
“practical knowledge” (Elbaz, 1981) or “personal practical knowledge” 
(Clandinin, 1985). These researchers assume that teachers know, use, and 
develop personal practical knowledge in their teaching. 

The idea of “image” is one form of personal practical knowledge, the 
name given to this special knowledge of teachers (Clandinin, 1985; Elbaz, 
1981). Images act as the coalescence to combine and form a whole of all the 
experiences within a teacher. An image has an emotional and moral coloring 
and links a person’s educational life, professional life, and his or her personal, 
private life. 

The research reported here furthers ongoing lines of research by focusing 
on how teacher practical knowledge is developed and expressed in an intern- 
ship experience. 


Kathy Skau is head of the Department of Teacher Education and Supervision in the Faculty 
of Education. Her teaching and research interests center on teaching, collaborative research, 
and teacher education. 
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Statement of the Problem 

Ongoing research (Clandinin, 1983, 1985, 1986; Connelly & Clandinin, 1985, 
1986; Elbaz, 1981) offers constructs of teacher practical knowledge as it is 
expressed in classroom and school situations. This study builds on ongoing 
research by focusing on how teachers’ practical knowledge is developed and 
expressed in the teaching assignments as experienced by two interns. The 
research purposes for the study were (a) to describe the personal practical 
knowledge of two interns in the Initiation to Teaching Project, and (b) to 
conceptualize the development of personal practical knowledge in the in- 
terns. 


Review of Related Literature 

During the last decade, some researchers who have been studying teaching 
have focused on teacher thinking processes (Clark & Peterson, 1986; Clark & 
Yinger, 1977, 1979; Peterson & Clark, 1978) and the development of teacher 
practical knowledge (Clandinin, 1985, 1986; Elbaz, 1981). In order to set this 
study into the context of the above mentioned research, a brief review of 
several key concepts is presented. The key concepts to be reviewed are 
personal practical knowledge, expressed as teachers’ classroom images, and 
the narrative method of inquiry. 


Concept of Personal Practical Knowledge 


The concept of personal practical knowledge embodies a dialectical view of 
theory and practice (Clandinin, 1983, 1985). In a dialectical view theory and 
practice are seen to be inseparable; practice is seen as theory in action and 
theories are modified and changed according to what occurs in the practical 
world of the classroom. 

The concept of personal practical knowledge is discussed in detail else- 
where (Clandinin, 1985, 1986; Connelly & Clandinin, 1985). The following is 
a brief summary as stated by Clandinin (1985). 


The “personal” is that knowledge which participates in, and is imbued with, 
all that goes to make up an individual. The personal aspect of the knowledge 
includes the experiences and knowledge each teacher holds which has arisen 
from the circumstances, actions, reactions, perceptions and interpretations 
that the person “knows.” This personal knowledge can be discovered in both 


the actions of the person and under some circumstances, by conversation. (p. 
362) 


By knowledge in the phrase personal practical knowledge is meant: 


that body of convictions, conscious or unconscious, which have arisen from 
experience (intimate, social and traditional) and which are expressed in a 
person’s actions. The actions in question are all those acts that make up the 
practice of teaching, including planning and evaluation. (p. 362) 


The “practical” in knowledge comes from the work of Elbaz (1981), whose 
conception of practical knowledge is a broad definition incorporating three 
terms: rule of thumb, practical principle, and image. Of interest to this study 
is the concept of image defined by Elbaz as the level at which the teacher’s 
feelings, values, needs, and beliefs combine to form brief metaphoric state- 
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ments of how teaching should be and marshals experience, theoretical know- 
ledge, and school folklore to give substance to these images. Clandinin (19838, 
1985) expands the concept of image to encompass the knowledge which 
resides at the nexus of the theoretical, the practical, the subjective, and the 
objective in a teacher’s life. An image links a person’s educational life, 
professional life, and his or her personal, private life and is permeated with 
emotional and moral coloring, in part derived from the experiences from 
which it originates. In this study the image of “classroom as community” 
becomes a magnet around which significant elements of experience were 
attracted and shaped in the lives of two interns. 


Narrative Method of Inquiry 


The narrative method of inquiry in the study of teaching attempts the 
reconstruction of classroom meaning in terms of narrative unities in the lives 
of classroom teachers. The narrative has to do with telling stories, in this case 
telling stories about ourselves as teachers. What is meant by unity is the 
“union in a particular person, in a particular place and time of all the person 
has been, and undergone in the past” which results in a “meaning-giving 
account and interpretation or story which focuses on the ways classroom 
actions are meaningful to the individual” (Connelly & Clandinin, 1986, p. 
15). The narrative unity focuses on the specific, concrete events in a teacher’s 
life which have been meaningful in that person’s teaching history and have 
provided the teacher with an interpretation and understanding of his or her 
classroom actions. The data, collected over an extended period of time, tend 
to take on the form of a story which focuses on the ways classroom actions 
are meaningful to a teacher or, in this study, to an intern. 


Research Design and Method 
The research was conceptualized in terms of a particular notion of the 
relationship between theory and practice referred to as a “dialectical” (Mc- 
Keon, 1952) relationship between theory and practice and between research- 
er and intern. Rather than a dichotomy or separation, the theory and practice 
inform one another and similarly the researcher and interns shared mean- 
ings about the classroom milieu. 

The intent of the dialectical research perspective is to come to understand 
how individual teachers understand their teaching in their own teaching 
situations (Janesick, 1982; Nespor, 1986). The research methods employed 
bring together the interest and purposes of both the teacher and the re- 
searcher (Elbaz, 1981). Teachers (interns) are viewed as persons who create 
meaning and value for the teaching actions in their own situation. The 
principal research activities are described under the following subheadings. 


Negotiation of Entry 

In the case of the first intern Ms. M,? initial discussions were held regarding 
participation in the research project in the late summer of 1985 with person- 
nel in the superintendent’s office of a school board. A written proposal was 
submitted, followed by a planning meeting with school board personnel. A 
school which had sukmitted a proposal under the Initiation to Teaching 
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Project for an intern to begin work September 1, 1985 was chosen as the 
research site. The principal, school staff, and the intern were willing to be 
participants in the study. 

The intern, a stranger to the researcher, showed some hesitancy in the 
initial stages of the research. She appeared shy and uncertain as to what 
expectations were held for participation in the research study. Time and 
effort were taken to establish rapport and negotiate the methods of research 
to be conducted. After the first visits and interviews, the typescript of both 
data sources were shared with the intern on a regular basis. Her responses 
and reactions to the data were included by the researcher in discussions 
during subsequent visits and interviews. 

In the second year of the study (1986-1987), entry into the school was 
negotiated directly with the intern Mr. K and confirmed by the principal, the 
research officer, and superintendent at the school board level. This intern 
had worked as the researcher’s assistant in a project during the summer, so 
that rapport was well established when data collection began in September 
1986. Again, typescripts of field notes and interviews were shared on a 
regular basis with the intern. Responses were acted upon in later discussions. 


The Interns 


Two interns, Ms. M and Mr. K, participated in the two-year study. Ms. M was 
a secondary school teacher with a specialization in English. During the 
research period (September 26, 1985 through May 15, 1986), she taught 
mathematics, social studies, science, and language arts to a variety of classes 
in grades 7 through 9. 

Rather than being assigned to a single supervising teacher, Ms. M worked 
closely with the teacher who was the specialist in the subject she had been 
assigned to teach. Moves from subject to subject and grade to grade occurred 
at regular two-month intervals, with the exception of several language arts 
classes to which she was assigned for the full year. The supervising teachers 
had established different classroom policies and thus the intern found herself 
working to establish relationships with several supervisors and their classes. 
In time Ms. M established herself as one of the team with her own classroom 
policies which students came to know and respect. A strong central school 
policy reflecting a common religious belief system helped make the intern’s 
and supervising teachers’ philosophies related to learning and teaching more 
congruent than might otherwise have been the case. 

Mr. K was an elementary route teacher with a specialization in social 
studies. During the research period (September 15, 1986 to June 15, 1987), 
he taught a split grade 4 and 5 with most of his time spent with the grade 5 
students. This intern taught physical education, mathematics, science, social 
studies, and language arts. 

Mr. K spent the entire year of his internship in the grade 4/5 classroom 
with one teacher. The teacher and intern established a close working rela- 
tionship, and as the year progressed Mr. K. gradually took over the class 
while the supervising teacher worked individually with those students who 
needed special attention. I observed many instances of collaboration: plan- 
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ning of lessons, planning units, selecting curriculum resources, and prepara- 
tions for parent-teacher interviews with the accompanying report card com- 
ments. Although not bound by a common religious philosophy, the intern and 
supervising teacher had clarified their philosophies regarding teaching and 
learning through a mutual sharing of ideas and competencies over an ex- 
tended period. 


Data Collection and Analysis 


The researcher employed a variety of methods to aid in the work of collecting 
data and conceptualizing the development of practical knowledge. First, field 
notes were recorded on paper on site and later dictated onto tapes to be typed 
as a series of ongoing entries. These notes were descriptions of the classroom 
sessions observed by the researcher during the half-day visits to the class- 
room(s) where the intern was teaching. No predetermined foci for the field 
notes were consciously used in the attempt to create a meaning-giving ac- 
count of the classroom events in collaboration with the intern. Notes were 
also kept during feedback interviews (intern responding to field notes col- 
lected by the researcher), planning sessions, or other activities in which the 
intern was engaged during the observation period. 

Second, interviews were a major data source for this study. The questions 
posed by the researcher during the interviews were formulated to have the 
intern further elaborate, extend, and interpret events recorded in the field 
notes and statements made in previous interviews. 

At the end of the study, field note data and interview data were read and 
reread. Notes were kept on words and/or phrases expressing ideas around 
themes or concepts which appeared regularly in the data. The field note and 
interview data were divided into content units under headings such as re- 
sponsibility, “minded” practice, validation, and territoriality. A further prob- 
ing of the raw data from the field notes and interview transcripts and a 
perusal of descriptive phrases identified as important to each intern led to 
the identification of practices and words which seemed to come together into 
what appeared to be images which each intern held. One image which 
appeared to act as the coalescence or magnet around which significant 
elements of experience were attracted in the lives of both interns was that of 
community. 

A narrative account summarizing the highlights and significant events 
and statements from each visit and interview was prepared for each intern. 
These accounts, up to 35 pages in length, not only provided feedback on the 
researcher’s interpretation of the observations and interview transcripts, but 
also advanced tentative ideas about the images each intern held. The ac- 
counts were mailed to each intern with the invitation to respond to the 
tentative interpretations and further clarify their perspectives. The interns 
responded to the narrative accounts and their responses were blended into 
this account. 
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Findings and Discussion 

Findings 

One image that appeared to be particularly pervasive and important to both 
the interns’ personal practical knowledge was a sense of community. To 
describe this as one image is probably misleading, for it is probably composed 
of a number of separate images. The image “community” appeared to be 
rooted in different experiential origins for each intern. At the level of practice 
similar goals were apparent as the images were expressed in words and 
actions. 

For the purposes of this study, the image of community speaks of a group 
of people bounded in space and time with common interests and values. The 
individual in the group is expected to share, communicate, and participate 
with others in ways understood and acceptable to the group. Members of a 
community share ideals which help them live together in an environment of 
harmony and well-being. For Ms. M the overarching ideals she advocated 
were those of caring and nurturing (Noddings, 1984). Mr. K was more inter- 
ested in establishing a democratic-style classroom where equal participation 
and getting along with others in the group were highly favored ideals. 

In the interviews no mention of community appeared, nor did the concept 
appear in the researcher’s field notes. Rather, the thoughts and aspirations 
expressed by the interns in the interviews, as well as actions taken in the 
classrooms, led the researcher to propose that an image of classroom as 
community might be held by the interns as part of their personal practical 
knowledge. As Schén (1983) states, personal practical knowledge is “know- 
ing-in-action,” tacit and often unarticulated. 


Ms. M 


Ms. M never directly talked about community, nor did the actual word appear 
in the field notes which described her actions in the classroom. Rather, the 
thoughts expressed in interviews, as well as actions taken in the classes, led 
the researcher to speculate that the concept of community was an image 
implicitly held by Ms. M as part of her personal practical knowledge. 

An exchange in one of the initial interviews began to suggest an image of 
community. 


I: The three boys slammed in; you made them go back and come in with a 
little more dignity. What was going on in your mind when you did this? You 
immediately responded and you came up with this tactic from your own 
background experience. 


Ms. M: Really, it is more personal than that—like that is not the way you 
enter into anywhere. Like you know, you walk in like civilized people and 
that’s the way it has to be done even if you are at school ... you know, where 
you are with your friends and you are supposed to have fun and things. I 
think it is my own background—like there is a way of doing things and let’s 
carry that over into all aspects of life. (Interview October 9, 1985) 


In the interviews and actions in the classroom Ms. M advocated a strong 
sense of appropriate behavior and norms regarding “civilized” behavior in 


106 


The Image of Community 


her classroom. A sense of being “in” or “out” of—of belonging or being 
banished from—the community if unwilling or unable to follow the norms of 
behavior was observed and discussed on numerous occasions. The following 
instance is an illustration of this. A disciplinary measure, implemented by 
Ms. M in the case of several disruptive students, was the use of banishment 
from the classroom. In her words, “I said starting tomorrow, you will find a 
desk in the hallway. You come in and listen to the instructions and then you 
are out in the hallway” (Interview October 30, 1985). The researcher ob- 
served this action in practice and noted that the disruptive students became 
“nobody” to Ms. M and ceased to exist as part of the community until 
students approached her on an individual basis to be reinstated into the 
classroom. 

Ms. M displayed a continual respect and concern for her students. She 
insisted on standards of communication, indicating politeness and considera- 
tion for one another. As stated, “I refuse to let my kids tell each other to shut 
up ... they are quite prone to giving each other put-downs. They got a lecture 
on that from me” (Interview March 7, 1986). This notion was elaborated: 


If I am asking them to be quiet and there are still a couple of people who are 
talking and a couple of people doing things that I just asked everyone else to 
stop, and everyone else has stopped except these two or three, then I say—like 
it is funny—“I don’t think that there is anyone in this class who is superior to 
anyone else, but obviously a few of you do and I don’t understand why.” When 
they realize, yes, 1am not any more special than anyone else in this class, they 
stop talking. (Interview August 7, 1986) 


Concerns for student privacy, a sensitivity to their feelings, and a need for 
being perceived to be “OK” were noted on several occasions. In a note-pass- 
ing incident, Ms. M informed her students: “I have respected your privacy in 
that I haven’t read a note and thrown it in the garbage. From now on, any 
note I catch being passed is going to be read out loud. I know now that the 
next note that I catch I am going to have to read.” The moral dimension to 
the image of community could be heard as she continued, “But if they are 
talking about other students, I don’t think that’s quite appropriate to read 
out loud.” 

A particularly stressful incident occurred when Ms. M caught a student 
cheating during an examination. Ms. M chose to deal with the matter pri- 
vately by speaking with the student, informing the principal and parents, and 
leaving the responsibility to the student to decide on the consequences and 
make amends. The student confessed her indiscretion and returned to class 
the next day. The intern proudly informed the interviewer, 


It was difficult to face this student the next day. The other kids, though, 
didn’t treat her any differently. She hasn’t become withdrawn or depressed. 
We tried to handle it internally. We didn’t get her on the defensive by saying, 
I caught you cheating, nor did we accuse her. The parents apologized for her 
behavior and let’s hope it doesn’t happen again. 


Again, the image of community appears to be implied in the way the 
intern resolves the incident and understands the event. The student returns 
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to the safe arms of the community after acknowledgment of stepping outside 
the acceptable boundaries of behavior held for the classroom community. The 
student continues to belong and act as a participating member of the com- 
munity. 


Mr. K 


Early in the interview in which the nature of the classroom was discussed, 
Mr. K summed up his position by stating: “I want a democratic-style class- 
room; one in which ‘one would get as much student input as possible,’ 
everyone has an even break in what they are trying to do” (Interview October 
29, 1986); “where getting along with each other is important” (Interview 
November 3, 1986); and a “friendly-type atmosphere prevails” (Interview 
October 15, 1986). 

A critical attribute of the community Mr. K was attempting to fashion 
had to do with thinking, asking questions, and “learning to question author- 
ity.” This theme appeared in the interviews on numerous occasions. “I 
encourage them to keep questioning—like the authority thing. That’s what 
I’d like to do. At the same time, they have to accept it [authority] and it’s very 
hard. Maybe I’m trying to do too much at once, but I’m just working with it 
now. I get this chance to practice” (Interview October 15, 1986). 

This image of community found expression throughout the study as is 
evidenced in the following excerpts. 


I continue to ask probing questions, just to get a little more information and 
so they’d [students] stop and think ... I’m trying to help students question 
authority ... I mean in a subtle sense in that they would question things 
within their own minds. They know their roles, but at the same time there are 
some things they should question ... if there’s something that seems wrong to 
them, well then, they can voice that. (Interview October 22, 1986) 


I want children to have national pride as Canadians. I want them to question 
authority. Books aren’t always right and encyclopedias may be out of date a 
bit. They [the students] can begin to pick up on some of these things. I also 
encourage them to pick out inconsistencies in what they hear. (Interview 
June 15, 1987) 


The field notes provided the evidence that the beliefs espoused by Mr. K 
were lived out in action. The use of questioning as an instructional strategy, 
the habit of answering a student’s question with another question, the en- 
couragement of thinking about an issue from various perspectives were 
actions recorded in the observations during the 10-month reporting period. 


Discussion 


The construct of the image of community was used as a conceptual device to 
give an account of the practices and distinctiveness of each intern’s experi- 
ence in the first year of teaching. Various dimensions make up the construct 
of image. Aspects of image include the moral dimension, the emotional 
dimension, the private and educational dimension, and the origin of the 
image in the coalescence of diverse experience. The conceptualization of the 
image of classroom as community is presented from two different experi- 
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ential origins. One perspective appears to combine and form a whole from a 
set of deep religious beliefs, while the second image coalesced from the ideals 
formed in a political context. 

The construct of image links together, both in its origin and in the 
function it serves, an individual’s personal, private, and professional educa- 
tional experience. The image of community has its origins in the intern’s 
private experiences and understanding of what his or her own community is. 
The image was expressed in how the classrooms were organized, how stu- 
dents were encouraged to relate to one another, how discipline measures 
were exercised, and how the interns related to the students themselves. 

Ms. M’s classroom was set up in what might be termed an authoritarian 
style in which she presented the class with a seating plan. The enforced 
seating arrangement came about as part of the consequences meted out to 
students who would not cooperate and settle down to work. As well, they lost 
privileges including washroom breaks, getting materials from hallway lock- 
ers, and borrowing from one another. They had “only themselves to blame for 
what had happened.” In contrast, Mr. K had students decide how they might 
pick their seats. They decided on the use of random numbers as being the 
fairest way to assign seats in a large U-shaped formation. 

In both classrooms students were encouraged to work together. In Ms. 
M’s community, the relationship was based on respect and standards of 
communication which reflected a politeness and consideration for each other. 
Students were discouraged in the use of put-downs, rude language, or hurtful 
teasing and joking. Mr. K emphasized participation and involvement in ac- 
tivities by all students. He worked hard to create a friendly atmosphere and 
stressed the need to cooperate and get along in the classroom discussions, 
projects, group activities, and the game situations employed in the physical 
education classes. He wished for all students to be perceived as equal and 
expressed some annoyance when individuals did not wish to participate with 
particular others on a task or class project. 

Several drastic disciplinary measures were undertaken by Ms. M when 
several students continued to be disruptive. The intern dealt with them by 
informing them of their loss of privileges and becoming “nobody.” As she 
described it, “I tell them, you listen to instructions and you do not participate 
in any activities that we are doing. Once you have your instructions, you go 
out in the hallway. There is a desk out there and you work in the hallway. It’s 
my decision when I let you back in the class.” 

In contrast, major disciplinary actions were not observed in Mr. K’s clas- 
ses. On occasion students were sent to the sideline to “cool off” during a 
physical education class, but then were incorporated into the group’s ac- 
tivities a short while later. When classroom noise reached a level considered 
to be unacceptable, a small handbell was rung, students stopped their activity 
on this cue, and Mr. K would calmly discuss the nature of his concern with 
them. Routines and expectations had been set at the beginning of the year 
and were in evidence until the end of the year. 

In relating to her students, Ms. M expressed concern with the need to 
model appropriate behavior so that “hopefully they [the students] would set 
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some standards for themselves.” A recurring theme—concern for the stu- 
dents’ well-being—was evident in daily interaction with individual students 
and in comments: “I see them as people. I treat them as people. I keep telling 
them, I’m going to treat you like people and you have to treat each other like 
people.” Her words and actions reflected the primary aim of the “one-caring” 
who Noddings (1984) describes as one who strives “to preserve and enhance 
caring in herself and in those with whom she comes in contact” (p. 172). In 
turn, one aspect of Mr. K’s philosophy had to do with his belief that teachers 
need to “know” their students in order to become (or be) effective teachers. 
He became better acquainted with his students, along academic and social 
dimensions, by encouraging them to talk to him, being interested in their 
interests, and providing written and oral feedback to students in their daily 
journals. In his actions and comments he indicated that he believed “that 
working from their [students’] world” was where a teacher needed to start. 
He continued to feel a great sense of responsibility for his students because 
“they’re still dependent on adults and not quite as independent as I some- 
times expect.” 

As suggested earlier in the expression of images, both in practice and 
words, a moral dimension was apparent. Images were not neutral; they 
suggested a better or worse situation. In part, the moral dimension emerged 
from the experiences where it had its origin and in the person’s judgment of 
the experience. In the case of Ms. M, the incident of cheating in her classroom 
presented her with several moral dilemmas: how to address the student, how 
to inform the parents, how to allow the student to make amends without 
being labeled as “cheat” by the other students, how to help the student face 
the consequences without a major psychological hurt. In reflecting on the 
incident, Ms. M continued: “It was difficult to face this student the next day. 
The other kids, though, didn’t treat her any differently. She hadn’t become 
withdrawn or depressed. We tried to handle it internally ... we didn’t get her 
on the defensive ... let’s hope it doesn’t happen again.” 

Besides discussing the issue of cheating with the student, Ms. M provided 
a model. To support her moral ideals, she needed to show herself to be caring 
and ethical. Hence she was not content to simply enforce rules. As she 
confronted the student, she began by attributing the best possible, rather 
than worst, motives to the student and then proceeded to explain to the 
student “that I cannot allow you to cheat because it isn’t fair and right to do 
so.” Ms. M did not need to resort to punishment because the rules were not 
sacred to her. What mattered the most was the student and how the student 
would in the future approach the moral problem of cheating as a result of the 
interaction with her. Ms. M expressed the hope that the student would learn 
to be ethical in her statement, “let’s hope it doesn’t happen again.” 

Mr. K’s commitment to the need to question authority found him facing 
the moral dilemma as to how far this activity might be taken in the lives of 
his young students. His comments reflected the perplexing situation: “I like 
to encourage them [the students] to think of questions, but there are times 
when you just want them to do it because you have only so much time.” In his 
quest to help students learn to question authority, he wanted the actions to 
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be carried out within acceptable parameters in a “subtle sense” in which 
students “know their roles” yet speak out when “something seems wrong.” 
He was trying to sort out the dilemma he faced between creating a “de- 
mocratic-style” class in which he went with the “flow” and “allowed freedom 
... for the [students] to explore on their own” and the need to meet expected 
achievement levels for each grade and the testing requirements. 

Mr. K’s moral dilemma relates closely to Noddings’ (1984) concept of 
moral education in which a commitment to caring as the ethical ideal “invol- 
ves a duty to promote skepticism and noninstitutional affiliation” (p. 103). 
He seems to recognize that in a deep sense no nation can be truly moral. A 
nation can only capture in general terms what caring, moral individuals 
would like to have happen in well-described situations. Laws, manifestos, and 
proclamations are limited and they may support immoral as well as moral 
actions. Only the individual can be called to moral behavior. Mr. K’s attempts 
to have his students question authority fits into the moral dimension of his 
image of community. 

As well, an emotional dimension of the image of community was apparent 
in the oral expressions and in the expression of actions in the classroom. As 
part of her approach to developing a supportive classroom, Ms. M set be- 
havior standards for her students. Students were not allowed to tell each 
other to shut up. Students were expected to defend one another and demand 
respect from each other. Disruptive students were banished from the class- 
room and became nobody until they decided to change their behavior. 

In speaking about his democratic classroom, Mr. K made much use of the 
words fair, equal, and responsibility. He wished to create fair classroom in 
which “everyone has an even break in what they are trying to do ... everyone 
gets equal attention” and is “given their chance.” At the end of the year, his 
image of a democratic community became more apparent in his comments 
during the final interview (June 17, 1987): “I’d like to set up a democratic 
classroom where the onus and responsibility is on them. They [the students] 
make the rules and hopefully [I will be able] to use the teachable moment and 
direct peer pressure appropriately.” 

Finally, the origin of the image of community influences each intern’s 
personal and professional expressions in talk and practice. It is beyond the 
scope of this article to speak in depth about the origins of each intern’s image 
of community. It is, however, necessary to acknowledge that the source of 
each intern’s concept of community would be different, because the image 
acts as the coalescence to combine and form a whole of an individual’s 
experiences. No two individuals experience the world in the same way. 

In the case of Ms. M, her image of community emerged from her religious 
roots and created a congruence between her professional and personal lives. 
In her classes there existed a sense of “being in” or “being out.” As long as the 
code of behavior and classroom norms were followed, a student was “in,” but 
willful or unintentional neglect of these standards of behavior or a perceived 
lack of commitment led to banishment until she decided the individual might 
return to the group. In class discussions issues were occasionally referred to 
a higher authority for resolution. This authority was that vested in the school 
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philosophy as it was expressed in religion classes, school rules, the values held 
by parents in the larger community, and in the belief system of the intern 
herself. 

As indicated earlier, Mr. K was a social studies major with background 
preparation in political science, history, and related social issues. In his 
classroom he consistently encouraged his students to “think about the way 
things are and the way they could or should be.” Social critique, although 
expressed in a limited way, appears to be a way of life for Mr. K. His 
beginning attempts at encouraging his students to question authority and his 
striving for more equality in the life of his classrooms are forms of social 
criticism. A social studies educator is encouraged to teach students to ques- 
tion the status quo in society, because the principle of becoming a responsible 
citizen includes the civic competence of critical questioning of authority. 
Competent citizenship implies the concept of citizens as social critics (van 
Manen, 1980). 

This intern also had a history of involvement in politics at the provincial 
and local levels. The political roots of the image of community gave rise to the 
concepts of equality, involvement, cooperation, fairness, and sharing. For 
Mr. K a democratic orientation functioned at tacit and explicit levels of 
awareness as he talked about his role in the teachers’ organizations and his 
elaboration of plans to create a positive atmosphere in his classroom. I can 
only speculate that Mr. K’s image of community may have emerged from his 
training as a social studies educator and his involvement in provincial and 
local politics. 


Conclusion 
The development of personal practical knowledge as expressed in the first- 
year teaching situation as experienced by two interns in the Initiation to 
Teaching Project is the focus of this study. The work drew heavily on Elbaz’s 
(1981) concept of practical knowledge, Clandinin’s (1985) concept of image, 
and Connelly and Clandinin’s (1985, 1986) concept of personal practical 
knowledge. 

The concept of personal practical knowledge embodies a dialectical view 
of theory and practice. Theory and practice are viewed as inseparable; prac- 
tice is seen as theory in action and theories are informed by the conditions 
existing in the classrooms. This personal practical knowledge is centrally 
connected in what Clandinin (1985) terms image. Images act as the coales- 
cence of experiences and link a person’s educational professional life and his 
or her personal private life. 

In this study of the development of personal practical knowledge in the 
first year of teaching, the image of classroom as community became the 
nucleus around which significant elements of past experiences were at- 
tracted and shaped in the classroom lives of two interns. The conceptualiza- 
tion of images developed in this study contributes to an understanding of the 
nature of personal practical knowledge that differs from that usually con- 
sidered to be knowledge. This personal practical knowledge is an emotional 
and moral knowledge; a personal and professional knowledge; a past and 
present knowledge; and, embodied in this study, in the image of classroom as 
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community. The image is expressed differently in classroom practices and in 
the lives of the students in the two classrooms. 

What this study suggests is not only that images function in the lives of 
classroom teachers, but that the existence of an image such as classroom as 
community can be played out differently in classroom practices and teacher 
talk. In this study, the image of community as held by two interns was 
expressed differently in how the classroom was organized, how students were 
encouraged to relate to one another, how discipline measures were exercised, 
the values each intern emphasized in the classroom, and how the interns 
related to the students themselves. 


Note 


1. This work was supported by a research grant from the Alberta Advisory Committee for 
Educational Studies 
2. Pseudonyms are used throughout to protect the anonymity of the participants. 


References 


Clandinin, D.J. (1983). A conceptualization of image as a component of teacher personal 
practical knowledge. Doctoral dissertation, University of Toronto. 

Clandinin, D.J. (1985). Personal practical knowledge: A study of teachers’ classroom images. 
Curriculum Inquiry, 15, 361-385. 

Clandinin, D.J. (1986). Classroom practice: Teacher images in action. London: Falmer. 

Clark, C.M., & Peterson, P.L. (1986). Teachers’ thought processes. In M.C. Wittrock (Ed.), 
Handbook of research on teaching (pp. 255-296). New York: Macmillan. 

Clark, C.M., & Yinger, R.J. (1977). Research on teacher thinking. Curriculum Inquiry, 7, 
279-394. 

Clark, C.M., & Yinger, R.J. (1979). Teachers’ thinking. In P.L. Peterson & H.J. Walberg 
(Eds.), Research on teaching (pp. 231-263). Berkeley, CA: McCutchan. 

Connelly, F.M., & Clandinin, D.J. (1985). Personal practical knowledge and the modes of 
knowing: Relevance for teaching and learning. In E. Eisner (Ed.), Learning and teaching 
the ways of knowing, NSSE Yearbook 84(2) (pp. 174-198). Chicago: University of Chicago 
Press. 

Connelly, F.M., & Clandinin, D.J. (1986). On narrative method, personal philosophy and 
narrative unities in the study of teaching. Journal of Research in Science Teaching, 
23(3), 15-32. 

Elbaz, F. (1981). The teacher’s practical knowledge: Report of a case study. Curriculum 
Inquiry, 11, 43-71. 

Janesick, V. (1982). Of snakes and circles: Making sense of classroom group processes 
through a case study. Curriculum Inquiry, 12, 161-190. 

McKeon, R. (1952). Philosophy and ethics. Ethics, 61(2), 79-100. 

Nespor, J. (1986). Academic tasks in a high school English class. Curriculum Inquiry, 17, 
203-228. 

Noddings, N. (1984). Caring: A feminine approach to ethics and moral education. Berkeley, 
CA: University of California Press. 

Peterson, P.L., & Clark, C.M. (1978). Teachers’ reports of their cognitive processes during 
teaching. American Educational Research Journal, 15, 555-565. 

Schon, D.A. (1983). The reflective practitioner: How professionals think in action. New York: 
Basic Books. 

van Manen, M. (1980). A concept of social critique. History and Social Science Teacher, 15(2), 
110-114. 


113 


pheabaracigoah a He , 
aueun ne i ri, ee ‘ 


a is ‘jane fet fee PY sel 
eee ere nathan Gaunt 
De ne Oe a ee i 
) ait, re Paps te: atte ee ear ree 
- a io ad bres a ¥ 
pian yay" 


a ot! ah na a a tt “eye PD a 
Sasi ah cceeceueienoe 0) AR, pe ns oe ee ee 
: an — , ee ’ iL . 


- A, Secale 4 oF ay ae 
hesabietaie 
ee ee 


pee re ope 9 dong 
ere “Pee eee oki ve 


ae ® nip ar Tae bling 
Po aay ee Peat Te a org sacle Ueratastl ital 


4 an Loom @ 
rr 4! L. Fie 


a re . tarn 


Sth te lee Pn oe y 
alas ay Silay ty el ye a 
; : adel pe. tee. aree* Wy 
Ont PE adie 
rw anh’ ery 


. een | Path pihayeaas 
fol) ee ee tee ae ah, 
ete RRS Fes 
: vipa! yy ate os 


ee ee te 
ayer Al GN rye ee Pane 
( Le: Ae Ws eee sa es re ; 
wy atl ob at arn yal WA. Aus | Meats Vo a 
whet BO & nie rae _ ead | oS 


.3 a may. 
COR rs 


eae bb Te eecsstb sara sustiae sane ro, 


a 
i : 

. pmcgeer eee ese FO ete 

Sargaas As 04 Aggy Phe a ae ny 

} sone ge Te 

5 oe nee he A cay Papr-haeers To ~~ 
~?* Pe Cen ne p= heird — 
> ae i a ial» ote 

SAT OL Maras ¢ r% ny ne: Coast ris : as 


GU 
may Sarthe Array o rev jy piles 
é 


The Alberta Journal of Educational Research Vol. XXXVI, No. 2, June 1990, 115-132 


MICHAEL G. WODLINGER 
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April: A Case Study in the Use of 
Guided Reflection 


This article reports the findings from that part of a recently completed study which 
focused on the use of critical incident analysis and guided reflection with preser- 
vice teachers in a one-year after-degree BEd program. The case study approach 
has been employed to demonstrate how, in one instance, critical incident analysis 
has been used to assist one preservice teacher to derive meaning from her experi- 
ences in the teaching practicum. The results of this case study analysis support the 
literature that contends the use of guided reflective journal writing is a useful 
process in assisting the development of “teacher.” The author further suggests 
that there is need for preservice teachers to develop professional relationships with 
“metaguides,” or mentors, who are familiar with the techniques involved with 
guided reflection. Such relationships more readily enable preservice teachers to 
derive meaning from experiences in classrooms and assist in the integration of 
learnings. 


Introduction 

Throughout the history of teacher education, discussion has focused on the 
impact of the student teaching practicum on the developing perspective of 
“teacher.” The vast body of literature on the development of the teacher 
appears to be consistently of the opinion that the teacher preparation process 
is a period of growth from the perspective of student toward the perspective 
of teacher (see, for example, Davis, 1981; Doyle, 1979; Edgar & Warren, 
1969; Friedman, 1982; Goodlad, 1982; Hart & Adams, 1986; Reeves & 
Kazelskis, 1985; Tabachnick, Zeichner, Densmore, & Hudak, 1983; Waller, 
1932). This literature further suggests that the practicum or practice teach- 
ing experience is highly influential in this growth process (Caruso, 1977; 
Egan, 1982; Friedman, 1982; McIntyre, 1983; Parkay, 1980; Zeichner, 
1980). 

Supporters of this notion of the importance of the teaching practicum 
take the position that these experiences have an impact on the attitudes, 
values, and beliefs of preservice teachers (Wodlinger, 1985). Hoy and Rees 
(1977) claim that, with continued exposure to the reality of the classroom, 
the becoming teacher’s attitudes shift toward those of the associate teacher. 
If, for example, the associate teacher assumes an authoritative stance (Hoy 
& Rees, 1977), then the becoming teacher shifts to a greater custodial 
attitude toward the management of student behavior. Moreover, when the 
practicum is highly oriented toward evaluation, as the typical practicum 
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appears to be, the preservice teacher’s attitudes tend to be emulations of the 
evaluator’s perspectives (Edgar & Warren, 1969). If such is the case, then 
this is strong evidence for ensuring that preservice teachers consciously 
obtain meaning from practicum experiences and events. 

In his study of the influences of field experiences on the attitudes and 
behaviors of student teachers, McIntyre (1983) recognizes the emotional 
impact of in-school experiences and concludes that “it is important to under- 
stand how the people and experience itself affect the preservice teacher ” (p. 
4). 

To continue this argument, Zeichner (1980) presents a position that the 
preservice teacher is not a tabula rasa, readily accepting the wisdom and 
knowledge of significant others, but a participant who takes an active role in 
professional socialization. The developing teacher does exert some control 
over professional socialization, “even if only to a limited degree ” (p. 33), and 
“it is still the student teacher who largely controls the activities of super- 
vision” (p. 35). Such control begs the deritualizing of the teaching practicum 
in order that the becoming teacher may understand the meanings that 
emerge from the experiences in the classroom and grow with that under- 
standing (Zeichner, 1980, p. 35). This development of meaning is most 
crucial in acquiring control over one’s learning and in identifying one’s 
professional stance. 


Reflection and the Preservice Teacher 

The process of reflection is a deliberate, purposeful activity that allows an 
individual to contemplate behaviors and events and reactions to them. More 
than that, reflection is a natural process that facilitates the development of 
future action from the contemplation of past and/or current behavior (Sch6n, 
1983). The reflective process assists student teachers in deriving meaning 
from their experiences. Nystrand and Wiederspiel (1977) claim that through 
the reflective process, especially when one is writing about matters of per- 
sonal importance, “the writer is able to break through to new constructions 
and new meanings for experience” (p. 105). 

Schén (1988) speaks to the role of “reflective transformation” in attend- 
ing to and drawing from experiences in the classroom. Through the use of 
reflective “stories” the developing teacher is able to attend to the derived 
meanings of classroom-based experiences and the underlying assumptions 
and beliefs that serve as the framing mechanisms for those experiences. 

The need for preservice teachers to be engaged in reflective activity is 
apparent when one realizes that each enters the preparation program with a 
well-developed set of beliefs and perspectives about teaching and teachers, 
learning and learners. Evidence exists for the claim that the associate teach- 
er has a major influence over changes to these perspectives during the 
practicum process. In his review of the literature, Zeichner (1980) discovered 
a plethora of studies pointing to preservice teacher attitude and behavior 
shifts toward those of the associate teacher and some evidence that such 


influence extends into the induction period, the first year or so of the inser- 
vice career. 
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Given that this influence does exist, it is imperative that the preservice 
teacher explore experiences in the practicum and derive personal meaning 
from those experiences. This assertion is even more valid when viewed from 
the perspective that the preservice teacher is developing mental schema of 
the roles of teacher and learner in the context of the classroom. These 
schema, I speculate, develop from the preservice teacher’s principles and 
beliefs about these and other aspects of the schooling process, gained through 
experiences as student and through experiences in theory classes in the 
faculty and in the practicum classrooms. Given the influence of these experi- 
ences, there is a need for the preservice teacher to actively and consciously 
consider them and examine their import. 


The Role of the Metaguide 

The role significant others play in this process is crucial. These may be peers, 
associate teachers, and/or faculty supervisors. In their discussion of learning 
metaphors, Forsythe and Haughey (n.d.) introduce the term “metaguide” or 
the human agent in the learning process who functions as the “universal 
traveller, who can give feed-back progress, monitor the domain of the journey 
and assist in goal setting, goal maintaining and the innovation of new goals 
... to assist the student in the journey through the domain” (p. 3). 

In the context of growth of becoming teachers, the metaguide focuses on 
the “reflective moment,” the juxtaposition of action and thinking about 
action. The faculty advisor or associate teacher might function as such a 
metaguide. 

Participating in this process of exploration of one’s experiences and 
reflection about those experiences allows the faculty advisor and associate 
teacher the opportunity to glimpse the world through another’s eyes 
(Nystrand & Wiederspiel, 1977). This is a social relationship, as Kelly (1955) 
noted: “To the extent that one person construes the construction process of 
another he may play a role in social process involving the other person” (p. 
618). 

My position is that the role of the metaguide, whether faculty advisor or 
associate teacher, is to facilitate the preservice teacher’s progress toward an 
understanding of the meaning of experiences and of his or her reaction to 
those experiences, in order that the formation of schemata, or frameworks of 
understanding, may be deliberate and rational rather than nonrational and 
accidental. My perception of the metaguide’s role corresponds with van 
Manen’s (1977) in that it is to assist the preservice teacher in working 
through the processes of “analyzing, clarifying individual and cultural expe- 
riences, meanings, perceptions, assumptions, prejudgments, and presupposi- 
tions” (p. 226). | 

The becoming teacher needs assistance in the interpretation and mod- 
ification of responses to the realities of the classroom and school in order to 
explore experience therein and to derive personal meaning. Such assistance, 
through a combination of coaching and mentoring interventions, attempts to 
lead the beginning teacher out of the morass of the immediate. The goal of 
the intervention is to help the teacher develop the more global perspective of 
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an active player in the developing awareness of the realities of the training 
experiences that affect his or her professional and personal life. 


Reflection and Journal Writing 


Nystrand (1977) asserts that language plays “a key role in altering our 
consciousness,” especially when entering presumably unfamiliar territory as 
does the preservice teacher when entering the practicum. The use of lan- 
guage through the medium of journal writing is especially valuable in de- 
veloping new awareness about experience and in constructing new meaning 
from experiences. As Nystrand (1977) writes: “language, like intelligence, is 
an instrument of adaptation.... language plays an instrumental role in facil- 
itating the individual’s entry into new.experience. It does this by serving to 
construct and maintain new meanings” (p. 99). 


Critical Incident Reflection 


Critical incident reflection has been used extensively in situations requiring 
medical treatment (Richter, McGrath, Humphreys, & Goodman, 1986; Sal- 
kovskis, Jones, & Clark, 1986; Sosin, 1983) and in educational problem 
solving (Berrill, 1986; Heinze, 1987; Stones, 1983). It provides that “personal 
importance” moment for the student teacher. This process involves the 
selection of an event or series of events that are of particular significance to 
the writer and analyzing the experiences and reactions to them. Its use has 
been shown to be appropriate when focusing on developing learning from 
one’s experiences in a specific context (D’Arcy, 1987). For example, critical 
incident reflection is particularly valuable when the preservice teacher re- 
flects on instructional and managerial situations and attempts to understand 
the flow of events and, more particularly, when developing alternative objec- 
tives and strategies. 

In summation, many influences operate in the practicum classroom that 
affect the growth of the preservice teacher. To enhance and direct the 
growth, to ensure that it is not haphazard or ad hoc, there is need for a 
metaguide, a significant other, associate teacher or faculty advisor. Such an 
individual will assist the becoming teacher in reflecting on these influences 
and in reflecting on the process of reacting to them. 

Guided reflection is an important component of this process. Such guided 
reflection empowers the preservice teacher to assume more direct control 
over the growth that he or she experiences as a result of being in the 
practicum classroom. 


The Case Study 
April participated in a larger project, the details of which are reported in 
Wodlinger (1988). She is a 24-year-old preservice teacher enrolled in a one- 
year, after-degree BEd program in a faculty of education that enrolls ap- 
proximately 215 students annually. 

Over the course of her training, April participated in 10 weeks (50 days) 
of teaching practica. These were divided into two-week blocks, with the 
exception of weeks one and two. The first experience, at the beginning of 
October, was an observation period of one day per week for a total of five 


118 


The Use of Guided Reflection 


days. The second week was an ungraded practice teaching session. The 
remaining eight weeks of practice teaching were graded by both a faculty 
advisor and the associate teachers. Each two-week period of evaluated teach- 
ing was with a different associate teacher, a different school, and a different 
faculty supervisor. 

April was selected as the subject for this case study primarily because of 
her interest in reflective journal writing. As a child she had been interested in 
diary writing but had done little from her teenage years until the present. 
Her enthusiasm entering the project was at a high level and was maintained 
throughout the larger project. In this project, I functioned as April’s advisor 
and metaguide. April submitted a critical incident reflection after each week 
of the practicum. 


The Critical Incident Reflection Journal 


The critical incident reflective writing required April and her colleagues to 

focus on an incident of personal importance occurring during the practice 

teaching session. In her writing, April divided her entry into the following 
segments: 

1. Context. A description of the practicum setting, and in particular the 
context of the incident. 

2. Incident Description. This segment involved relating the events of the 
incident. In particular, April was asked to focus on antecedents to the in- 
cident, behaviors of the actors involved, her behaviors during this time 
period, and her emotional reactions to those behaviors. 

3. Analysis. In the analysis segment of the reflective writing, April was 
asked to focus on the antecedent events and behaviors and her be- 
haviors, in order to determine if any connections existed. As well, she 
focused on the rationale for her behaviors and explored her emotional 
reactions. 

4, Learnings. April was asked to discuss personal learnings that arose 
from her analysis of the incident, the behaviors that occurred, and her 
reactions to those behaviors. This segment of the writing was designed 
as a journal mirror through which April could develop and consider alter- 
native courses of actions in the light of her objectives. 


Guided Reflection 


After each pair of journal entries was reviewed, April and I met, in an 
informal format, to discuss her experiences and to share the meaning that 
she gained through reflection. During these individual sessions, also held 
with other participants in the larger study (Wodlinger, 1988), I attempted to 
help April to (a) explore her experiences and (b) develop insights that focused 
on her role as an emerging teacher. In a real sense, I functioned as a 
metaguide, discussed by Forsythe and Haughey (n.d.), assisting in April’s 
development of schema for being a teacher. 

Our interviews were audiotaped and the resultant transcripts used as 
data for subsequent interviews. In this way, we had a natural linkage be- 
tween interviews, providing physical as well as conceptual continuity. 
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It was vitally important that, during our interviews and through my 
written responses, April focused entirely on her derived meanings from 
incidents and reactions. To have forced her to accept my meanings would 
have been dysfunctional. Thus I was careful to avoid making value judg- 
ments about April’s behavior and that of either the associate teacher or the 
students. This perspective did not stop me from displaying empathy with 
April’s struggles and anxieties. This support was necessary for her to feel 
comfortable interacting with me and be open in her discussions and analyses. 

In fulfilling my role as metaguide, then, I concentrated on asking ques- 
tions rather than providing direction, as I wanted April to discover her own 
meanings and learnings. This approach proved to be difficult at times, as I 
found myself mentally making judgments and taking positions and at times 
wanting to share these and indeed to impose these on April. I deliberately 
took firm control of my reactions and consciously became aware of my 
behavior when interacting with April. 


Findings and Discussion 
Of the several distinct themes that emerged from the analysis of April’s 
journal writings, the theme of control, focusing on her perceptions of the 
teacher as controller and dealing with issues of authority and obedience, was 
most pervasive. I have focused on this theme because, as may be seen below, 
April’s images of both the student and the teacher revolved around her 
perceptions of authority and control. 

The first of April’s journal entries centers on a critical management 
incident with a male student in a grade 7 music class. Her entry is in three 
parts. The first part, a description of the episode, provides the reader with 
some insight into principles held in the concept of teacher as controller. At 
the end of this entry I have provided my written response. This provides an 
example of the communication process in which April and I engaged outside 
the informal interview. 


November 30, 1987 
Episode 


Before we left the regular classroom the teacher told the students that they 
were to sit in a formation of boy-girl-boy, etc. I didn’t understand the reason- 
ing behind making the students sit in this manner but didn’t feel it was my 
place to question the teacher. This was the first time I had been given the 
responsibility of taking the students to music class by myself. Music was 
taught by a parent volunteer so the children needed quite a bit of supervision 
during this period. 


Implicit in April’s concept of the teacher as controller is the principle that 
the teacher’s policies are not to be questioned. The teacher is the controller of 
student behavior. Associated with this principle is the perception of herself as 
student, as evidenced by the comment: 

“I didn’t understand the reasoning behind making the students sit in this 
manner but didn’t feel it was my place to question the teacher.” The follow- 
ing statement confirms her perception of herself as student, through the 
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intimation that she is being tested much as a student might be tested. “This 
was the first time ... so the children needed quite a bit of supervision during 
this period.” 

In the episode description there is another principle dealing with the 
controlling theme. 


When we arrived at music, all of the students sat as they had been asked 
except for two boys. One of them moved when I asked him to but the other 
boy, P, refused. I was at a total loss as to how to deal with the situation and as 
a result he and I ended in a battle of wills. I told him in no uncertain terms 
that he had to move, but he continued to refuse until another boy in the class 
told him not to be “stupid” and to move when I asked him. He finally moved 
to the appropriate spot but by this point I felt frustrated, humiliated, and 
really angry with myself. I knew that I had dealt with the situation badly but 
I didn’t know what I could have done to make it better. 


April sees herself as being in competition with her student for control of 
the classroom: “I was at a total loss as to how to deal with the situation and 
as a result he and I ended up in a battle of wills.” At this point another 
principle dealing with control emerges from the writing—the teacher has the 
sole responsibility for control of the classroom—“another boy ... told him ... to 
move when I asked him.” Here April discusses her feelings of frustration, 
humiliation, and anger, mainly directed toward herself. I would speculate 
that these feelings are indications of a perceived lack of congruence between 
what she proposes to happen, according to her overriding concept of the 
teacher as controller, and what actually does occur. To this point in her 
interactions with students two principles have been violated: The teacher’s 
policies are not to be questioned, and the teacher has sole responsibility for 
control of the classroom. 

In the analysis section of her entry, April explores the rationale for her 
reactions. In doing so, she uncovers yet another principle dealing with con- 
trol. 


Analysis 


Looking back on the incident, I am beginning to recognize some of the feelings 
which prompted my reaction to P. When he refused to do as I had told him to, 
I felt that my authority was being threatened and as a result I became 
defensive. I felt that if I gave in to his refusal, it would be a sign of weakness 
resulting in a loss of power with the entire class. In that split second of his 
refusal, my back went up and I thought to myself, “I am going to make this 
kid do as I say, even if it kills me!” As much as I think that my emotional 
reaction was probably a very normal one, I don’t think that the way I dealt 
with or expressed my emotions was very productive, either for myself, or for 
the student. 


The implicit principle that April uncovers here states that the teacher 
must not retreat from policy if students refuse to follow that policy: “I felt 
that if I gave in to his refusal, it would be a sign of weakness resulting in a 
loss of power with the entire class.” 
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Once more the focus on control “at any cost” comes through: “In that split 
second of his refusal, my back went up and I thought to myself, ‘Iam going 
to make this kid do as I say, even if it kills me!’”” This reflection-in-action is 
interesting in that there are no alternatives generated. Rather, what is 
displayed is her concern that she was receiving a direct challenge, “my 
authority was being threatened.” 

In the final section of this entry, Learnings, April attempts to understand 
the reasons for her reaction. In doing so, she reaches a critical point in her 
thinking about control. 


Learnings 


I think that when P refused my request to change seats, I should have asked 
him aside so as not to disturb the rest of the class, and then sat and talked to 
him for a while. I could ask him why he didn’t want to sit between two girls 
and shown respect for his feelings by listening and responding to these. After 
listening to P, I could then explain to him the reasoning behind this seating 
formation. The problem here was that I didn’t know why it was important for 
the students to sit in the formation in which they had been asked. I didn’t 
even agree with the teacher’s request because I knew how uncomfortable it 
made a lot of the students to sit beside members of the opposite sex. It was 
therefore difficult for me to explain the logic of this seating arrangement to P. 


It is in her discussion of learnings that April takes tentative first steps in 
“journal thinking” about being teacher. Reflecting on her behavior, she 
generates alternatives that keep in mind the students’ interests and needs 
rather than her own. As well, she begins to question the logic of another 
teacher’s policy. Although she does not make the break from her held position 
regarding teacher authority, she is becoming more aware of the incongruity 
between her implicit principles of teacher control and the realities of student 
behavior. 

The following is an excerpt of my written response to April when she 
submitted her journal to me: 


I too find the teacher’s request to be unusual. What possible motive do you 
think she had? What do you think might have happened, had you asked the 
teacher for her rationale for this seating arrangement? ... I can understand 
your feelings of anxiety and vulnerability, during this episode. You were faced 
with a direct attack on your authority as the teacher. I agree that your 
reaction to his behavior was a normal one for you, given your current state of 
preparation as a teacher and your feelings of anxiety being in charge of a 
classroom so early in your training. How do you think P felt, when he found 
himself in this confrontation with you? Upon reflection of your interaction 
with P, you have developed a very clear alternative to the strategy you did 
use. The key to your behavior, I believe, lies in your comment, “The problem 
here was that I didn’t know why it was important for the students to sit in the 
formation in which they had been asked.” That degree of confusion had a 
great influence on your responding to P. How might you have reduced that 
degree of uncertainty, before you were faced with the confrontation? 
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In our subsequent interview, April and I focused on the two questions I 
posed to her and issues that she brought to the surface. The intent of my 
questions was to assist April in dealing with: 

1. developing a sense of other in her analysis of classroom behavior, and 

2. anticipating events that occur in the classroom, so as to avoid unneces- 
sary confrontations. 

Our interview lasted approximately two hours. 

April’s focus on control continues in her January 19 journal entry. The 
critical incident dealt with at this time centers around another male student 
whose lack of response to April’s attempts to stimulate his activity causes 
some difficulty. April sets the scene: 


January 19, 1988 
Episode 


I had introduced a new unit and story in language arts to the students the day 
before K arrived. We had read the story aloud and each of the students had 
drawn a picture of some sort of “flying machine,” as our unit was on airflight. 
In the class for which K arrived, the students were answering questions 
related to the story and writing a new ending to it. K sat down in his seat and 
when I explained to him what we were doing, he simply stared at his closed 
book and didn’t respond. I wasn’t sure how much of his reaction was due to 
lack of understanding (English is a second language for these students) and 
how much was simply due to a lack of interest. I left him to read the story to 
himself and he slowly leafed through the pages for a half hour. When I 
returned and asked him if he had understood the story, he looked up at me 
but didn’t respond. I ended up sitting with him and explaining the entire 
story aloud and then I asked him to at least try to draw a picture as the others 
had done. After some hesitation, he eventually drew a nice picture of a 
helicopter. 


April is beginning to grapple with differences in learning abilities among 
students. It is evident from her reaction to K’s lack of response that the 
student’s behavior does not fit her perception of how students ought to learn. 
Her statement, “I wasn’t sure how much of his reaction was due to a lack of 
understanding ... and how much was simply due to a lack of interest,” 
indicates that she senses something is wrong but is unable to act on this 
knowledge. She does engage, though, in some speculation toward an under- 
standing. 


Analysis 


I felt really frustrated in this situation as I didn’t seem to know how to get 
through to K. I feel as if I not only short-changed K, but I short-changed the 
rest of the class also by making his work easier than theirs. Although K’s lack 
of response didn’t anger me, it frightened me because I had no idea why he 
was acting in such a manner. I think the reason I sat down and told him that 
whole story was because I had no idea how to help him with his reading 
(which was obviously poor). His unresponsiveness also made me feel insecure. 
The thought crossed my mind that maybe he was only not responding so as to 
see how uncomfortable he could make me feel, and to what lengths I would go 
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to get something out of him. If I was being tested, I didn’t deal with it very 
well. 


April’s analysis of the episode appears to be a critical discovery point. Her 
statements, “it frightened me because I had no idea why he was acting in 
such a manner” and “I had no idea how to help him with his reading” 
confirm, for April, that she lacks knowledge in dealing with students who 
learn differently. Here she is facing a child who does not fit her image of the 
learner—passive and compliant—and is afraid. K’s behavior appears to force 
April into dealing with herself in her image of the teacher as the “expert” and 
“in control” and she is found wanting. She then falls back on the concept of 
teacher as controller, which leads to the speculation that she is interpreting 
all perceived “inappropriate” behavior in the context of control. 

Another interesting observation that arises from her writing is the sug- 
gestion that April has a picture of student dynamics that contains an element 
of competition. Her comment, “I short-changed the rest of the class also by 
making his work easier than theirs,” indicates a perception of students as in 
competition with each other for gain. This perception does fit into the control 
perspective. 


Learnings 


Even in hindsight I am not sure what I could have done to make this a more 
productive situation for both K and myself. Perhaps I would have been better 
to explain to K that I was confused as to why he wasn’t responding. Because 
his silence was making me uncomfortable it felt like he controlled the situa- 
tion. I could have told him that when he was ready to talk he could approach 
me, and in the meantime just told him the work that he was supposed to do 
(to catch up with the rest of the class). 


April’s learnings indicate an opening up of herself, to allowing herself to 
be more vulnerable: “Perhaps I would have been better to explain to K that I 
was confused as to why he wasn’t responding.” Is she preparing herself to 
take a risk? She certainly is exploring the concept of ownership and respon- 
sibility: “I could have told him that when he was ready to talk he could 
approach me.” 

The control theme continues in the journal entry of February 19, 1988. It 
is in this writing that April recognizes a need to focus on being in control and 
actively seeks help. 


February 19, 1988 
Episode 


One of the students was answering a question and I was prompting him to dig 
a little deeper and come up with more for his answer. D, who is a rather large 
child for his age and has a belligerent attitude toward me and the school, 
interrupted the exchange and “grunted” that there was no more to the 
answer and I should leave the other students alone. I was so taken aback that 
at first I didn’t know what to do. After a short pause, I just looked at D and in 
a sarcastic tone said, “Thanks for telling me how to do my job!” He gave me a 
dirty look, and the discussion continued as before. 
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Once again April’s image of the learner is tested. As yet, evidenced over 
the course of several months’ writing, she is not prepared for a deviation 
from her perceptions of either the student, “I was so taken aback that at first 
I didn’t know what to do” or the teacher, “Thanks for telling me how to do 
my job.” However, during this episode, April reaches a crisis point. 


Analysis 


Boy—I really didn’t want to write about this episode! I was so angry that I 
reacted the way I did that I was determined to try and forget about it. But as 
the days went by and it kept bothering me, I figured that the only way to 
resolve the issue was to write about it and maybe get some additional insight 
from your feedback. I am so angry that I allow myself to feel so threatened by 
students! (something similar to this happened during my first practice teach- 
ing session). Am I so insecure that even a twelve-year-old child can make me 
feel out of control? Oh no—there’s that word CONTROL! I struggle so much 
with my need to feel “in control”—-maybe that’s what is at the base of this 
whole incident. I desperately needed to feel like I was not only in control of 
myself but also in control of the whole group of students around the table. D’s 
comment made me feel as if I was losing control on both counts. I felt as if I 
was being pushed off a very precarious balance and I therefore felt vul- 
nerable. I rose to his challenge of my authority by becoming defensive. Maybe 
if [had more self-confidence I would be comfortable sharing a certain amount 
of control/responsibility with the students, and wouldn’t need to feel in total 
control all of the time. (It takes a lot of energy to be in control all the time.) 


In dealing with D, April places herself in a power struggle. D’s behavior 
was challenging indeed but not confrontational. April’s response, though, 
was an overreaction. It is this reaction to D’s “grunted” comment that forces 
April finally to focus on control as an issue. 

In the analysis of the incident, April confronts her image of the teacher as 
controller and authority figure: “I desperately needed to feel like I was not 
only in control of myself but also in control of the whole group of students 
around the table.” This realization forces April to deal with her need for 
control—“I struggle so much with my need to feel ‘in control,’””—and to focus 
on dealing with these feelings—“Maybe if I had more self-confidence I would 
be comfortable sharing a certain amount of control/responsibility with the 
students.” 


Learnings 


I would have liked to have responded to D by asking him why he thought that 
there was no more to the answer than had already been given. This reaction 
would have ended any power struggle before it even began. I’m not sure how 
best to accomplish a greater sense of self-confidence so as to preclude this type 
of incident from happening again, but at least now I understand where my 
reaction was coming from. This process of writing about it has certainly 
helped me feel more peaceful about my situation. It always amazes me how 
writing can be so therapeutic! Please share with me any insights you may 
have about needing to feel “in control” in the classroom. I am really at a loss 
about whether I should be striving for control, or striving to be able to let go 
of this need. 
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April’s assessment of what she learned from this experience produces 
insightful self-discovery, “at least now I understand where my reaction was 
coming from.” Of importance now is her realization that she requires help in 
dealing with this issue of control, “Please share with me any insights you may 
have about needing to feel ‘in control’ in the classroom.” She is beginning to 
reassess her image of the teacher. 

In her final focus on control, April appears to be on the road to resolving 
her dilemma. As her next journal entry shows, she has begun to experiment 
with new concepts about teacher as controller, a strong indication that she is 
beginning the process of breaking away from strongly held concepts. 


February 24, 1988 
Episode 


I collected from all students the questions which they had written and 
randomly chose 5 of these to read aloud to the class. They were to answer each 
of these in their notebooks as I read them aloud. Once everyone had written 
answers for the questions, we went through them one by one evaluating the 
quality of the questions. The students told me what they thought was good or 
bad about the question and why. Their evaluations were based on how 
difficult or easy it was to answer the question, and to what degree it forced 
them to think. I wrote all of these notes on the blackboard and by the time we 
had finished, there were some recognizable trends in what had come out. I 
then developed a definition of the term criteria with the class and told them 
that from their analysis of the questions, they could develop a set of criteria 
for writing good questions. They came up with about 10 criteria which I wrote 
on the board and they wrote in their notebooks. To close the lesson I reviewed 
what we had done and we talked about the fact that the students themselves 
had basically run the whole lesson, and I as a teacher had simply clarified and 
organized what they already knew. There was a sense of pride among the 
students that they had been able to teach themselves. 


From April’s thoughts in this journal entry may be seen a shift in her 
image of the teacher. April appears to be discovering that control has dif- 
ferent meanings than being directive and authoritarian. Involving students 
in choice behavior had its obvious benefits: “There was a sense of pride 
among the students that they had been able to teach themselves.” 


Analysis 


I had such a good feeling after this lesson! The students had controlled and 
run practically the whole lesson, and I could actually see learning taking 
place. The content of the lesson was simple but the students were working at 
a high cognitive level (evaluation) and seemed to feel some “ownership” over 
their learning. It was such a neat feeling to flow with the students and to tap 
their resources. 


This experience has been truly a valuable learning one for April. She has 
recognized the students’ role in ownership, “the students ... seemed to feel 
some ‘ownership’ over their learning” and appears to be coming to an under- 
standing of the role of facilitation in the learning process—“It was such a 
neat feeling to flow with the students and to tap their resources.” 
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Learnings 


I just realized as I write this up that for an entire lesson I was able to hand 
over control to the students without feeling out of control myself! Wow—what 
a revelation! Why is it that I didn’t feel threatened or afraid by the fact that 
the students were directing the course of the lesson? I guess because I was 
controlling the framework of the lesson I felt comfortable allowing the con- 
tent and the process of the lesson to be controlled by any students. Have I 
finally discovered the concept of “shared responsibility” in a lesson? 


April’s coming to understand the benefits of relinquishing control is a 
critical moment in the developmental process in becoming teacher. She 
comprehends that her role as facilitator is quite different than her role as 
controlling teacher, “because I was controlling the framework of the lesson I 
felt comfortable allowing ... controlled by the students.” 

There is an even more significant difference in this last journal entry. As 
will be noticed, April did not report confrontation or challenge. Rather she 
expressed elation in her interaction with the students, mainly due it seems to 
the high degree of interactive teaching that she experienced, during which 
the students contributed significantly to the content in April’s framework. 
This leads to the question of what might have occurred had one or more 
students challenged April’s directions or had not taken the exercise as seri- 
ously as they apparently did. Subsequently, her comments about control in 
this context take on a different meaning than do previous comments. Unfor- 
tunately, due to circumstances beyond my control I was not able to follow up 
this issue with April. My main interest at this point would be to determine 
how April would now deal with instances of apparent defiance that will 
inevitably occur even in the interactive mode she now espouses. Although she 
has a long way to travel in her quest to be a competent teacher, April has 
made a major discovery about herself and her role. 

In summary, when April entered the project, she brought with her clear 
perspectives of the roles of students and teachers. These perspectives or 
propositions screened her perceptions of behaviors, both hers and those of her 
students, in the practicum experience. Analyses of April’s journal writings 
and subsequent discussions with her, following each practicum, suggest that 
April held the following propositions to be true: 

1. The teacher is the sole authority in the classroom. 

2. Students are in a position of obedience with regard to the teacher’s au- 
thority. 

3. The teacher’s policies are never to be questioned by students. 

4, The teacher will not retreat from stated policy. 

Then, as she occupied her role as student teacher, April believed that her 

policies were to be followed without question. These propositions and others 

formed April’s concept of the teacher as controller. April was forced to 

confront those deeply held views about students and teachers when she 

experienced student behaviors that challenged them. This confrontation 

process was formalized through journal writing. 

In interviews with April, at which time she and I discussed her views of 
the practicum experience and her role in it, April’s perceptions of herself as a 
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teacher emerged. There were noticeable changes in these images from the 
first journal entry and subsequent interview to the last. Toward the end of 
her practice teaching, April began to see herself still as a controller but in the 
light of different roles for students. As she discovered, she could gain a degree 
of security through controlling the framework or context of the lesson, while 
allowing the students some control over the content and the lesson flow. 
This, she discovered, provided them with a meaningful sense of ownership 
and developed some measure of commitment to the lesson. 

I believe that these changes were facilitated in part by April’s confronting 
those propositions presented above through journal writing and by my guid- 
ing her through the change process. Thus, by the conclusion of her practicum, 
April had identified that the focus on control was blocking her development 
to teacher, had addressed the issue of the need for control, and had put into 
place strategies to deal with the need. Her February 24 journal entry is an 
indication that, although April had a long way to travel in her journey toward 
teacher, she had taken a positive step in her dealing with the need for control 
in the classroom and her concept of teacher as controller. 

Three issues arise from this discussion of April’s learnings from both her 
experiences in the practicum classrooms and her journal reflections. The first 
deals with the level of reflection employed; the second focuses on levels of 
competence; and the third issue deals with an emancipation process. 

April is reflecting at the technical level of reflection (Schén, 1983; van 
Manen, 1977) in that she is dealing primarily with the issues of student 
control and management from very much a technical perspective. That she 
has an egocentric focus in the analysis of her behaviors is indicative of this 
level of reflective thought. At this stage in her development to teacher, April 
was naturally more centered on her own behavior and dealing with that 
behavior and its concomitant beliefs than on the impact of her behavior on 
others. Although through our written and oral communications I attempted 
to refocus April’s attention and reflective thought to a curriculum focus, my 
efforts had little impact. Even her final comments about allowing students 
control were at the rational-technical level. 

In order to reflect more often at the curriculum impact level, April will 
need to become more comfortable with her role and more self-assured about 
her functioning as teacher. This development may occur naturally, given that 
in her first few years of inservice teaching she is assisted through her growth 
and that she continues her reflective analyses of classroom events and the 
teaching-learning cycle. 

As April and I communicated through the journal and through personal 
interviews, we discussed the idea of “letting go” and allowing students more 
control of their experiences and hence their learning. Associated with the 
propositions of control that April took into the classroom were notions about 
students’ abilities to manage their own learning, to be in part self-directed 
learners. Thus it seemed that propositions about teacher control were based 
on beliefs about students’ abilities to be responsible. By the end of her journal 
reflections, April appeared to have developed a greater understanding of the 
need to restructure the learning environment enabling students to exercise 
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more learning responsibility. However, this understanding must be develop- 
ed further so that it will become part of April’s practical knowledge of 
teaching and be translated into action. 

The question of emancipation is also relevant. In the context of April’s 
situation emancipation is defined, in the Freirean sense (Freire, 1981), as 
breaking away from preconceived beliefs and values that tend to enslave. 
April’s beliefs about herself as a teacher and her beliefs about control tended 
to enslave her to a particular perspective of classroom behavior and interac- 
tive style with students. In confronting these beliefs through journal writing 
and guided reflection, she was able to (a) recognize that these beliefs existed, 
(b) focus on the suitability of her beliefs about control and her role as a 
controlling teacher, and (c) understand the need to free herself from their 
constraints. This emancipating process then enabled April to approach un- 
derstandings of and to focus on her professional growth. Perhaps the final 
word on this matter should come from April herself: “This process of writing 
about (experience) has certainly helped me feel more peaceful about my 
situation. It always amazes me how writing can be so therapeutic!” 


Conclusion 
The process of becoming teacher requires more than a body of skill and 
knowledge and more than mere experience in a practicum classroom. As 
April’s case study suggests, there is a real need for becoming teachers to focus 
on the propositions they hold that form their concepts and beliefs about 
teachers, students, and the processes involved in interacting with students in 
classrooms. 

April’s case study is unique, as April is a unique individual. However, her 
situation as a becoming teacher is common, similar in many ways to her 
colleagues involved in the larger study and similar to the many case studies 
of beginning teachers cited in the literature. The importance of this realiza- 
tion, to my understanding, is that it heightens an awareness of the need to 
help becoming teachers focus on the concepts that they hold, concepts that, 
when fitted together, form schemata or frameworks for thinking about and 
behaving in the classroom. 

There is ample evidence in the literature (see Zeichner, 1980, for example) 
to suggest that neither the theory base gained in faculties or colleges of 
education nor the practice teaching experience itself will have much effect on 
a person’s becoming teacher. What does seem to make a difference is the 
internalization of both a theory base and practical knowledge into the teach- 
er’s existing belief and concept framework. 

We know that preservice teachers enter their training with sets of beliefs 
and concepts firmly developed (Wodlinger, 1985). April’s journal writing has 
confirmed this. If these schema do not change or change in incomplete or 
inappropriate ways, I speculate that the theory base and the learnings from 
the practica will be skewed to fit in the existing schemata, rather than 
changing the schemata to fit the newly discovered theory and practical 
learnings. 

Evidence from April’s writings and interview data lead to the speculation 
that she did not rise above the technical-rational (Schén, 1983) reflective 


129 


M.G. Wodlinger 


stage. In a real sense, April was not able to focus on any other level of 
reflectivity. With a limited base of experience and strong perspectives and 
values of herself as student, she was dealing with technical aspects of teach- 
ing all through her preservice year. 

The key to appropriate integration of new learnings in existing schema 
that are being changed to accommodate them, I believe, is intervention and 
mediation strategies that allow becoming teachers to examine their held 
beliefs and concepts and examine them in light of their visions of themselves 
as teachers. Guided critical incident journal reflection is one strategy that 
shows promise in effective mediation of held beliefs and concepts and learn- 
ings and in effective intervention that will allow for meaningful change and 
growth. 

This article has focused on the use of guided critical incident journal 
reflection with one preservice teacher. Through the case study approach, I 
have attempted to show the use of this tool in the development of teacher in 
a preservice program. The guided reflection process involves more than 
journal or diary writing. The added component of guidance by a significant 
other shows great promise in assisting the developing teacher in exploring 
the myriad experiences encountered during the training period. The use of a 
metaguide, a mentor—an associate teacher or faculty advisor—to question, 
probe, and, if needs be, confront to enable the preservice teacher to reflect on 
the behaviors in the classroom is essential. Such a relationship between the 
preservice teacher and a significant other will more readily enable the preser- 
vice teacher to derive meaning from experiences in the classroom and will 
assist in the integration of those learnings. 
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Intern Satisfaction with 
Cooperating Teacher Supervision 


This article reports the ratings of 126 interns of 31 present and ideal clinical 
supervision behaviors before and after a four-month internship to determine their 
perceptions of satisfaction with cooperating teachers’ supervision. Training and 
experience in supervision were also examined as related factors. The satisfaction 
measure of least difference between present and ideal scores indicated intern 
supervision satisfaction increased and moved closer to the ideal when interns and 
cooperating teachers experienced training and frequent practice in clinical super- 
vision. The most used and most wanted supervisory behavior was when the 
observer “stayed at least 30 minutes.” Audio, video recording, and charting 
physical movement continue to be the least used and least wanted supervisory 
behaviors. 


Student teaching is generally recognized as the most important preservice 
experience both by researchers (Brimfield & Leonard, 1983) and by teachers 
in their first few years of teaching (Clark, 1984). There is, however, little 
agreement on what constitutes best student teaching. Some researchers have 
suggested that the role of the cooperating teacher is crucial, and others have 
advocated the application of teacher effectiveness findings to the preservice 
experience. The College of Education, University of Saskatchewan, actively 
supported by the Saskatchewan Teachers’ Federation, has created a program 
that emphasizes these two aspects. In this four-month program, cooperating 
teachers and interns are trained and then practice clinical supervision and 
effective teaching, but is it the “best” student teaching experience? Hays 
(1982), at a conference on “Student Teaching: Problems and Promising 
Practices,” suggested when determining success in student teaching experi- 
ence the major research question should “examine the factors in student 
teaching that make it professionally satisfying or unsatisfying from the 
student teacher’s point of view” (p. 77). The questions that would examine 
the relationship of cooperating teacher, effectiveness, the nature of the su- 
pervision experience, and intern satisfaction remain unasked. 


Problem 
The conceptual base of satisfaction, supervision, and supervisory behaviors 
were identified and applied to student teaching and cooperating teachers’ 
literature. Herzberg’s (1968) job satisfaction work extended to teachers by 
Sergiovanni (1967) and recently reviewed by Perko (1985) formed part of the 
conceptual framework for this study. Job satisfaction was defined as the 
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extent to which institutional role expectations are congruent with personal 
need and disposition. Miskel, Glasnapp, and Hatley (1975, p. 40) provided the 
two-column structure for satisfaction measure with their conceptualization 
of job satisfaction as the difference between actual (present) experience and 
individual expectation (ideal). 

Clinical supervision defined by Acheson (1982) as supervision which in- 
cludes cooperative planning, observation, and feedback for the purpose of 
improving instruction provided the interaction frame. Krajewski (1982), 
Reavis (1977), and Acheson and Gall (1987) pointed to clinical supervision as 
the most positive form of teacher supervision. This was reinforced by Spell- 
man and Jacko (1988) who indicated that clinically supervised student teach- 
ers had more positive, effective internships than student teachers who were 
not clinically supervised. Therefore, 31 clinical supervision behaviors created 
by Shinn (1977), modified by Acheson (1982), and validated by Khalil (1983), 
Duncan (1984), Townsend (1984), and Smith (1986) were selected as items 
for construction of the “supervisory behaviors” section of the survey. The 
items covered the supervision cycle of preconference, observation, and post- 
conference. 

Teacher effectiveness findings have been used by Hunter (1980) and 
Rosenshine and Stevens (1986) to formulate technical approaches to teach- 
ing. These techniques, which are the core of the Saskatchewan internship 
model, advocate specific lesson formats and other procedures that provide a 
generic formula for teaching all subjects. 

Despite Blumberg’s (1980) conclusion that “the work of supervisors ... has 
little effect in raising the quality of instruction as a whole” (p. 234), Koehler 
(1986) and earlier researchers (Friebus, 1977; Seperson & Joyce, 1973; Yee, 
1969) have concluded that the supervision of the cooperating teacher is the 
key human element in successful student teaching. 

As both interns and supervising teachers in the sample had been in- 
structed and received coaching in supervision and effective teaching over 
four months, one aspect remained to be examined. Is supervision by the 
cooperating teacher a source of intern satisfaction? 

From this general concern to consider the practicum experience from the 
interns’ point of view, the researcher asked more specifically: 

1. What are interns’ perceptions of their cooperating teachers’ supervision? 

2. What are interns’ perceptions of ideal supervisory behavior? 

3. What supervisory behaviors bring intern satisfaction? 

4. Does intern satisfaction with supervision of interns vary as a function of 
training and experience in supervision? 


Significance 
The answers to these questions would be helpful in understanding forces that 
influence intern support or opposition to supervision; revealing the practices 
that may be maintained, incorporated, or deleted from current supervisory 
practice; and providing data for public statements that may guide policy, 
training, and implementation of more effective teacher preservice practice. 
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Methods 

The researcher used a survey to gain general dimensions and specific details 
about the level of intern satisfaction with supervision. It consisted of the 
“supervisory behaviors” questionnaire created by Shinn (1977) which 
measures teacher perceptions of present and ideal supervisor behavior before 
observation, during observation, and following observation of the class. It 
employed two Likert-type, five-point scales ranging from “never” to “often,” 
one on the left margin of each question for “present” responses and the other 
on the right margin for “ideal” responses. To establish baseline data, the 
survey was administered to interns by college supervisors at the first in- 
tern/cooperating teacher training seminar. The survey questions prompted 
interns to recall previous student teaching supervision and to identify ideal 
supervision. At the conclusion of the internship, the college supervisors 
readministered the survey during the final school-site observation to deter- 
mine changes after intern training and four-months’ experience in super- 
vision for effective teaching. 


Sample 
In the fall 1988 semester, 6 of 10 intern groups were selected. Initially 136 
(45 males, 91 females) elementary and secondary interns responded and 126 
(44 males, 82 females) completed the postinternship surveys. On the basis of 
subject or grade preference, one third of the interns were placed in urban 
settings. The remaining interns were placed in rural schools to the north and 
west of Saskatoon. 

Interns observed, assisted, and gradually increased their teaching load 
from one lesson per day to two weeks of full-time teaching in the final weeks 
of the internship. Intern growth was guided by monthly seminars dealing 
with planning, classroom management, questioning, reacting, presenting, 
and supervision techniques. One or two of these effective teaching topics 
became the intern’s monthly teaching focus. College supervisors instructed, 
modeled, and provided practice for the pairs at the seminars and completed a 
cycle of supervision with each intern once a month. Each day, cooperating 
teachers supervised and took data on one formal lesson of their intern. 
Accurate records of all planning and observations were kept by each intern. 
In all interactions, interns were encouraged to display leadership, to select 
observation targets, and to reflect on their own teaching. 


Results 
Intern mean ratings of cooperating teacher supervision are displayed in 
Figures 1 and 2 and described below. For all measures, interns rated the 
observation phase of the supervisory cycle lowest and had greatest satisfac- 
tion with postobservation conferences. The most consistently low-ranked 
supervisory behaviors for all measures (preinternship, present and ideal; 
postinternship, present and ideal) were “audio and video recordings” and 
“charting physical movement.” They were also least satisfying for preintern- 
ship and postinternship when satisfaction was measured as the least dif- 
ference between present and ideal means. These behaviors were rated as 
rarely used and ostensibly seldom wanted by numerous teacher samples 
(Acheson, 1982; Khalil, 1983; Smith, 1986; Townsend, 1984). Interns and 
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Seldom 
Sometimes 
Usually 


1. Meets prior to visit 

2. Finds teacher objectives 

3. Finds expections of students 
4. Finds teacher concerns 

5. Involves T in choosing methods 
6. Identifies expected behavior 
7. Suggests observation techniques 
8. Suggests self-supervision 

9, Makes verbatim notes 

10. Writes teachers questions 

11. Writes student responses 

12. Records task analysis 

13. Charts student responses 

14. Makes audio recordings 

15. Charts physical movements 
16. Makes video recordings 

17. Observes problem child 

18. Records subjective feelings 
19. Stays for thirty minutes 

20. Meets after visit 

21. Gives direct advice 

22. Gives opinions 

23. Relates T's perceptions 

24. Encourages T inferences 

25. Clarifies with questions 

26. Encourages alternatives 

27. Accommodates T's priorities 
28. Listens more than talks 

29. See T's comments 


Pre Conference 


Observation 


Post Conference 


Present Ideal 


Figure 1: Mean ratings of supervisory behaviors 
Pretest—Interns 
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Seldom 
Sometimes 


1. Meets prior to visit 

2. Finds teacher objectives 

3, Finds expections of students 
4, Finds teacher concerns 

5. Involves T in choosing methods 
6. Identifies expected behavior 
7, Suggests observation techniques 
8. Suggests self-supervision 

9, Makes verbatim notes 

10. Writes teachers questions 
11. Writes student responses 
12. Records task analysis 

13. Charts student responses 
14, Makes audio recordings 

15. Charts physical movements 
16. Makes video recordings 

17. Observes problem child 

18. Records subjective feelings 
19. Stays for thirty minutes 

20. Meets after visit 

21. Gives direct advice 

22. Gives opinions 

23. Relates T's perceptions 

24, Encourages I inferences 
25. Clarifies with questions 

26. Encourages alternatives 

27. Accommodates T's priorities 
28. Listens more than talks 


Pre Conference 


Observation 


Post Conference 


Present Ideal 


Figure 2: Mean ratings of supervisory behaviors 
Posttest—Interns 
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supervisors may need further training in these techniques to make them 
common practice and to reduce the sense of threat associated with these 
techniques. Other supervisory behavior ratings are listed below under the 
subsections of the two surveys. 


Preinternship 


Present. The most frequently used preinternship behavior was “meets 
with me after observation.” This point was reinforced in a previous study by 
interviewed teachers who said “best observation experiences had immediate 
post conferences” (Smith, 1988). “Stays with me at least 30 minutes” was 
the next highest rating. This satisfies interns as most observations are of 
sufficient length to ensure fair assessment of the class. 

Ideal. The highest rated ideal supervisory behavior was also “meets with 
me after observation” followed by “meets with me before observation” and 
“gives advice to improve teaching.” Advice seeking is characteristic of most 
student teachers according to Glickman’s (1985) description of teachers who 
respond to “directive” supervision. 

Satisfaction. The supervisory behavior giving greatest satisfaction was 
also “stays at least 30 minutes.” This was followed by “gives opinion regard- 
ing teaching.” This infers that interns had as much subjective opinion from 
their cooperating teachers as they wanted. 


Postinternship 


Present. The most frequently used supervisory behavior perceived by 
interns at the postinternship stage was “acknowledges my comments.” This 
suggests a prevalent nondirective supervisory style recommended by Glick- 
man (1985) and Sch6n (1987) where interns are empowered to reflect on 
their own practice. Interns were also usually receiving “praise and encour- 
agement” and had cooperating teachers who “found out intern’s concerns.” 

Ideal, The highest rated ideal items also reflected the consideration be- 
haviors wanted from the supervisors. The top supervisory behaviors were 
“finds out my concerns” and “acknowledges my comments.” “My” was the 
dominant word suggesting postinternship interns focused on concerns for 
self. They remain at Fuller’s (1969) first level of professional maturity. 

Satisfaction. Greatest satisfaction came to interns when supervisors 
“stayed at least 30 minutes” and recorded students “at task behaviors.” This 
second behavior was a source of relief to interns who, like Levy’s (1987) 
first-year teachers, frequently had fears of classroom management problems. 

The least satisfying supervisory behavior after “audio and video record- 
ing” was when cooperating teachers did not “suggest a variety of observation 
methods.” This may imply that interns are most anxious to know about all 
aspects of their teaching and that cooperating teachers need further observa- 
tion techniques training. There is, for example, no provision for qualitative 
observations in the internship model represented in the present study. 


Prerelationship/Postrelationship 


Overall satisfaction with present supervisory behaviors increased and moved 
closer to the ideal when interns and cooperating teachers experienced mutual 
training and had frequent practice in clinical supervision over the internship 
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(compare Figure 1 with Figure 2). The increased satisfaction supports 
Blumberg’s (1980) observation that multiple observation visits are needed to 
effect real change in teacher behavior, and Townsend’s (1984) contention 
that respondents, trained and experienced in supervision, had a high level of 
satisfaction with supervision. 

Interns in this sample, in agreement with Krajewski (1982), Reavis 
(1977), and Acheson and Gall (1982), are apparently verifying the need for 
supervisors to build on foundations of trust, respect, and cooperation with 
their supervisees. 


Conclusions 

Perceptions of intern satisfaction with supervision can be measured by rep- 

licating results from related studies and by comparing present and ideal 

perceptions of supervisor behaviors before and after internship. Training and 
experience in clinical supervision were also related significantly to satisfac- 
tion. The results suggest the following: 

1. College supervisors should provide school-based training in audio and 
video recording and charting physical movement. Then, further study of 
these behaviors should be initiated to determine if they are or are not 
suitable school-site observation techniques. 

2. A wider variety of other observation methods should also be taught to 
meet the diverse needs of interns. 

3. Mutual training for interns and cooperating teachers in clinical super- 
vision and time to implement this supervision model does result in in- 
creased satisfaction with supervision. 

Supervision does continue to elicit mixed reactions. While its ideal pur- 
pose of helping teachers improve instruction is applauded, its actual practice 
is frequently frustrated. The potential is often not realized. 

Conditions of this supervisory practice that made it satisfying to interns 
were training and experience. Specific training factors included mutual su- 
pervision training for interns and cooperating teachers; ongoing coaching 
and consistent modeling of clinical supervision by the cooperating teachers 
and the college supervisors; and a clear goal of mastering effective teaching 
behaviors that gave interns a reason to practice supervision. 

Finally, extensive experience was provided that allowed the interns and 
cooperating teachers time to develop supportive collegial relationships and 
time for interns to see the results of supervisory practice in their own 
instructional improvement. 

Utilization of these recommendations will bring improved supervisory 
behaviors, increased intern satisfaction with supervision, and ultimately, 
improved instruction for students. 
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Teacher Participation in 
Curriculum Development: A Study 
of Societal and Institutional Levels 


It is common practice to include teachers in curriculum committees operating at 
the societal and institutional levels of the educational system. However, the two 
levels differ in their distance from the classroom and in their responsibilities for 
curriculum development. This study compared the responses of teachers par- 
ticipating in curriculum committees at the two levels. Differences were found in 
the teachers’ motivations to join their committees as well as in the satisfactions 
and dissatisfactions which they derived from participation. 


Curriculum theorists approach the study of curriculum today from two major 
vantage points. One group is philosophical in orientation, reflecting on the 
nature of the educational experience and its place in the social and political 
scheme of things. Books and articles written by members of this group tend 
to be scholarly and theoretical rather than practical in nature. The other 
group approaches the study of curriculum from a managerial standpoint. 
Attention is given to the administrative aspects of curricuium, including the 
selection, organization, and supervision of people involved in curriculum 
decision making. Publications often focus in a practical way on curriculum 
change and how that change can be facilitated. 

The study described in this article stems from the second orientation. It 
describes the responses of teachers to participation in curriculum committees 
with the intention of offering practical suggestions to administrators who are 
responsible for the development of curricula. 

The train of thought which led to the study began with the work of 
Goodlad and Associates (1979) who conceptualized curriculum development 
as occurring at three levels of an educational system: societal, institutional, 
and instructional. The societal level is the most remote from the teacher’s 
daily work. It includes federal agencies as well as state or provincial depart- 
ments of education. The institutional level includes both the school district 
and individual schools, while the instructional level refers to individual 
teachers or teams of teachers working with students in their own classrooms. 

The study presented in this article focused on the societal and institution- 
al levels. At both levels, it is common practice to form committees to develop 
curricula, and those committees ordinarily include classroom teachers. Ini- 
tially the inclusion of teachers may have been a political maneuver for, as 
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Bidwell (1965) points out, “The policies or rules determined by these commit- 
tees gain at least indirectly the legitimacy of teacher consent” (p. 1015). But 
more recently the growing professionalization of teachers has drawn atten- 
tion to the contribution that teachers can make to curriculum committees. 
Teachers are recognized as having specialized knowledge (Carson, 1984; 
Connelly & Ben-Peretz, 1980) as well as a practical orientation derived from 
their work in classrooms (Doyle & Ponder, 1977-78). As Carson and Friesen 
(1978) explain, “It is generally felt that since teachers possess professional 
knowledge they should be involved to a considerable extent in educational 
decision-making in order to practice their expertise as well as to protect their 
professional interests” (p. 2). 

However, although the participation of teachers in curriculum commit- 
tees is perceived as desirable, the responses of teachers to participation are 
mixed. Positive responses seem to be contingent on the kinds of decisions the 
teachers will be making (Ponder & Bulcock, 1976) as well as the teachers’ 
stake in those decisions and their belief that they have the necessary exper- 
tise to make the decisions (Hoy & Miskel, 1978). Positive responses also 
depend on the teachers’ perception that their participation is genuinely 
solicited and is not simply a formality (Duke, Showers, & Imber, 1980). 

Another variable worthy of investigation is the level of the educational 
system at which the curriculum committees are working, for there are dis- 
tinct differences between curriculum development at the societal and institu- 
tional levels. A brief picture of curriculum development at the two levels is 
drawn below, with particular attention to the Canadian context in which this 
study took place. 

In Canada, the Constitution Act of 1867 granted to the provinces control 
of and responsibility for education (Lawson & Woock, 1987). Thus societal- 
level curriculum committees sponsored by a provincial department of educa- 
tion develop curriculum policy for the schools of the province. They express 
this policy in printed guidelines which define what students ought to be 
learning in the various subject areas at each grade level (Werner, 1987). 
Goodlad, Klein, and Tye (1979) refer to these guidelines as the “formal 
curriculum,” that is, a curriculum which has official approval. The com- 
mittees’ work is shaped by debates and trends in the larger society (Werner, 
1987). Their work is also influenced by the degree of centralization versus 
decentralization which is currently in vogue. The guidelines they produce 
may be highly content specific or may provide general statements or sugges- 
tions on the assumption that local boards and teachers will fill in the details 
(Fullan, 1982). 

Curriculum committees at the institutional (district and school) level are 
responsible for implementation of the provincial guidelines. Although the 
department of education exercises central control, the committees have sub- 
stantial scope to determine content and material, pedagogical methods, and 
assessment of students’ progress (Dickinson, 1987). Committee members 
must interpret the provincial guidelines for the local situation. According to 
a study by Robinson (1982), this may not be easy due to problems inherent in 
the guidelines such as a lack of clarity and a dogmatic stance which dis- 
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courages reflective and deliberative activity at the institutional level. The 
committees receive little assistance from the province as they endeavor to 
interpret provincial curricula. This situation, while frustrating, is consistent 
with the concept of loose coupling (Boyd & Crowson, 1981; Meyer & Rowan, 
1983) which Larocque and Coleman (1985, p. 149) describe as a “mutual 
noninterference pact.” 

As these brief descriptions show, curriculum committees at the societal 
and institutional levels differ in certain respects. It is possible, then, that 
teachers are motivated to join curriculum committees at the two levels for 
different reasons, and they may derive different satisfactions and dissatisfac- 
tions from the experience. If it is important to include teachers on commit- 
tees at both levels (and there are both political and practical reasons for doing 
so), it would be helpful to know if this is the case. Administrators responsible 
for the operation of curriculum committees at the two levels may find the 
data particularly valuable. 


A Description of the Study 
The purpose of this study was to compare the responses of teachers to 
participation in curriculum committees at the societal (provincial) and in- 
stitutional (district and school) levels. The following research questions 
guided the study. 
1. Are teachers’ motivations to participate in curriculum committees 
similar or different at the societal and institutional levels? 
2. Are the satisfactions teachers derive from participation in curriculum 
committees similar or different at the societal and institutional levels? 
3. Are the dissatisfactions teachers derive from participation in curriculum 
committees similar or different at the societal and institutional levels? 
It was decided that a descriptive study of teacher-participants would be 
appropriate at this exploratory stage. 


The Study Sample 


The study was conducted in two phases in the province of Alberta, Canada. 
Phase I focused on the societal level of curriculum development, that is, on 
the provincial department of education which is known as Alberta Education. 
Lists of curriculum committees established by the Curriculum Branch of 
Alberta Education were obtained. Fifteen current committees were iden- 
tified whose membership included full-time teachers. One teacher was ran- 
domly selected from each of those committees. Eight of the teachers were 
members of ongoing curriculum coordinating committees, while the other 
seven teachers served on ad hoc committees. The teachers were drawn from 
many parts of the province because Alberta Education makes a point of 
ensuring geographic representation on its committees. 

Phase II extended the research to the institutional level. Broad geographic 
representation was also desirable at this level because Alberta is large and 
school districts may differ in their curriculum developmert policies and 
procedures. Because it was not feasible for the researcher to travel through- 
out the province, it was necessary to locate teachers from across Alberta who 
were currently residing in one area. This was accomplished by asking grad- 
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uate students in the Faculty of Education at the University of Alberta if they 
had served on curriculum committees in their school districts in the past two 
years. All 16 teachers who had done so participated in the study. Eight of the 
teachers had served on district-level committees, while the other eight had 
served on school-level committees. 

The provincial and local groups were similar in two respects. Geographic 
representation was achieved in both groups, and the teachers were drawn 
from all three levels of schooling: elementary, junior high, and senior high. 


Data Collection and Analysis 


A structured interview was conducted with each teacher. The interview 
questions dealt with four major categories of information: (a) a description of 
the committee and the way in which it functioned; (b) the background which 
the teacher brought to the committee; (c) the teacher’s reasons for joining the 
committee; and (d) the teacher’s responses to working on the committee. 

In order to obtain valid data, the following procedures were employed. 

1. The length of the interviews ranged from 60 to 90 minutes, allowing suf- 
ficient time to explore the questions in depth. 

2. Interviews were conducted privately with no interruptions. 

3. Teachers were assured of anonymity in the reporting of their responses. 

4. The use of an interview guide ensured that the same questions would be 
asked in the same order. 

5. Probing questions encouraged teachers to expand on their responses. 

6. The questions focused primarily on current or recent events which were 
fresh in the teachers’ minds. 

7. The interviews were tape-recorded, allowing the researcher to con- 
centrate on the teachers’ responses and ask probing questions rather 
than taking notes. 

The tape recording of each interview was transcribed. This allowed ac- 
curate reporting of a teacher’s responses and enabled the researcher to 
interpret specific responses in the context of the entire transcript. The 
responses of the teachers were then collated for each interview question. The 
length of the responses ranged from a sentence to many paragraphs. In each 
case, the researcher extracted the essence of the response from the teacher’s 
words in order to facilitate compilation of the data. 

For example, two teachers described their motivations to join their com- 
mittees in this manner. “I needed to have the interaction with some col- 
leagues or professionals whom I could relate to.” “I think it’s really 
important for teachers to talk to other teachers about what they are doing.” 
Both of these statements were summarized as “Need for interaction with 
other educators.” This summary statement was then used as a category of 
response, and the number of teachers whose responses fell into that category 
was recorded in a table. 

Tables 1, 2, and 3 present the teachers’ motivations, satisfactions, and 
dissatisfactions respectively. In order to keep the tables to a reasonable size, 
categories were listed only if they included responses from at least one 
quarter (i.e., four) of the teachers at either the provincial or local level. The 


144 


Teacher Participation in Curriculum Development 


Table 1 
Teachers’ Motivations for Participation in Curriculum Committees 
See eee Le ee est Lord) tlhe otatn ov TS oe 
Number of Teachers 


Motivations Province Local Total 
Incas n=16 n= 


Coordinating Ad Hoc Subtotal District School Subtotal Total 
Desire to be involved 


in decision making 5 5 10 1 5 6 16 
Acquisition of information, 

ideas, and materials 0 2 2 3 3 6 8 
Curiosity about the curriculum 

development process 3 3 6 0 0 0 6 
Interest in the specific 

tasks of the committee 0 0 0 3 2 § 5 
Need for interaction 

with other educators 0 0 0 4 0 4 4 

Table 2 


Teachers’ Satisfactions with Participation in Curriculum Committees 


Number of Teachers 


Satisfactions Province Local Total 
Gia ibe n=16 n=si 


Coordinating Ad Hoc Subtotal District Schoo! Subtotal Total 
Interacting with 
other educators 7 7 14 if 5 12 26 


Growing professionally 3 1 4 6 4 10 14 
Influencing curriculum 

decisions 3 1 4 2 3 5 9 
Enjoying the tasks 

performed by the committee 2 1 3 z 3 5 8 
Making a contribution to the 

work of the committee 5 2 7 0 0 0 7 


descriptions of teachers’ responses which appear in the following sections are 
based on the information in the tables. 

The data directly related to the three research questions are presented in 
this article. Other data, drawn from the same study, have been used to 
explore the status of curriculum development at provincial and local levels 
(Young, 1988); factors related to teacher interest in curriculum commitees 
(Young, 1989); and curriculum implementation from an organizational per- 
spective (Young, 1990). 

Findings 
The findings of the study are divided into two sections: societal (provincial) 
level and institutional (district and school) level. To facilitate understanding 
of the teachers’ responses, each section begins with a description of the 
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Table 3 
Teachers’ Dissatisfactions with Participation in Curriculum Committees 


Number of Teachers 


Dissatisfactions Province Local Total 
n=15 n=16 n=31 


Coordinating Ad Hoc Subtotal District School Subtotal Total 
Hierarchical and 


political constraints 3 6 9 6 0 6 15 
Slowness and inefficiency 

of committee work 4 on 7 1 6 7 14 
Lack of time for 

committee work 0 0 0 6 5 11 11 
Financial constraints 1 4 LS, 0 0 0 = 
Difficulty with the work 

of the committee 0 0 0 2 2 4 4 


curriculum committees operating at that level. It then continues with the 
teachers’ motivations for joining their committees as well as the satisfactions 
and dissatisfactions they derived from participation. Occasional quotations 
from the transcripts are used to exemplify the teachers’ responses. 


Societal (Provincial) Level 


Curriculum committees at the societal (provincial) level operated under the 
auspices of the provincial department of education, officially called Alberta 
Education. Two types of committees were formed: curriculum coordinating 
committees and ad hoc committees. 

Descriptions of the committees. Eight of the teachers were serving on 
curriculum coordinating committees. These were ongoing committees with a 
gradual change of membership over time. Alberta Education made a serious 
effort to include broad geographic representation as well as a variety of 
educators on the committees such as university professors, school admin- 
istrators, subject-area consultants, and classroom teachers. The teachers 
were selected by Alberta Education personnel from a name bank of inter- 
ested teachers provided by the Alberta Teachers’ Association. Alberta Educa- 
tion paid the members’ expenses for serving on the committees plus a daily 
honorarium. 

The curriculum coordinating committees were involved in the develop- 
ment of policy. All but one was affiliated with a particular subject such as 
science or language arts taught in the schools of the province. The exception 
was a committee developing policy for integrating learning resources into the 
process of curriculum development. Words used to describe the work of the 
committees were “establish,” “restructure,” “update,” “overhaul,” and “pro- 
vide direction.” The importance of this work was confirmed by a teacher who 
said about her own committee, “I was just totally shocked or surprised at how 
powerful that committee is.... Basically, we set down what they were to do 
and how to do it. Kind of where to do it. How long to do it.” 
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However, as Werner (1987) points out, “Policy needs to be clothed with 
concrete content and activities” (p. 93). Therefore, having established policy 
for the various subject areas, the committees turned their attention to such 
matters as the selection of specific content, the sequencing of content, the 
development of instructional modules, and the selection of resources for 
classroom teachers to use. The curriculum coordinating committees often 
formed ad hoc committees to perform these tasks. 

Seven teachers were members of ad hoc committees. The teachers were 
ordinarily asked to join the ad hoc committees without reference to a name 
bank or concern for geographic representation. They were simply known, 
personally or by word of mouth, by Alberta Education or curriculum coor- 
dinating committee personnel for their interest or expertise in the subject 
under study. There was a higher percentage of classroom teachers on the ad 
hoc committees than on the curriculum coordinating committees, perhaps 
because they were often dealing with tasks related to implementation. When 
the ad hoc committees had completed their tasks, they were disbanded. 

Motivations. The primary motivation for teachers to join provincial cur- 
riculum committees was a desire to be involved in decision making. The 
teachers recognized Alberta Education as the primary source of the curricula 
they had used for years in their own classrooms, and they had reached the 
point where they wanted to have some input into those curricula. As one 
teacher explained, “I had certain ideas and I wanted to see some of those 
ideas eventually flourish throughout the province. The only avenue open was 
to see if I could get on the curriculum committee.” 

A secondary motivation for teachers on provincial committees was curios- 
ity about the curriculum development process. “I was curious,” said one 
teacher. “I wanted to see how these curricula get developed.” Another teach- 
er explained that she joined her committee “To find out how things come 
about that filter down to us in the classroom.” This motivation is consistent 
with the teachers’ desire for involvement in decision making, for effective 
participation depends on an understanding of the curriculum development 
process and how it works. 

As the teachers discussed these two motivations for joining their cur- 
riculum committees, it became apparent that a great gulf existed between 
Alberta Education and the teachers’ classrooms. The teachers simply did not 
know what Alberta Education did or how its work was performed. As one 
teacher explained, “I really didn’t have any idea what I was getting into. 
What it would involve.” Another teacher said that, when he joined his com- 
mittee, he thought, “Golly, I wonder what is happening, and in hopes that, 
once I found out, I could contribute something.” 

As a result, it came as a pleasant surprise when the teachers found they 
possessed a unique, practical knowledge of classrooms which other members 
of the committee did not have. This knowledge, they said, was the greatest 
asset which they brought to their committees. As one teacher explained, “T 
think that the experiences that a classroom teacher has are so different. I 
believe that, if you get away from the classroom for too long, you sometimes 
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really lose touch with the reality of what is happening within that class- 
room.” 

The teachers found, to their gratification, that their practical knowledge 
was welcomed by the other committee members. These people were school 
administrators, consultants, and professors, and the teachers apparently 
perceived these committee members as being of higher rank than themsel- 
ves. “I’m really quite surprised that they even listened,” said one teacher. “I 
guess I didn’t feel they’d listen to me as much as they do.” 

Satisfactions. When the teachers discovered that they were bona fide 
committee members, they began to enjoy the interaction with other people. 
In fact, such interaction was the greatest satisfaction they derived from their 
committee work. This satisfaction ranged from the joy of meeting new people 
to acquiring new ideas to having one’s own ideas challenged. As one teacher 
remarked, “We could get together and get to know each other as people.” 
Said another, “I love to sit and listen to the other members of the committee 
and find their opinion about things that I have concerns about.” Another 
typical comment was that “The give and take with other people with dif- 
ferent ideas is always useful. It helps you clarify some of your own thinking 
in certain kinds of ways.” 

The teachers also found considerable satisfaction in making a contribu- 
tion to the work of their committees. As one teacher explained, “I think you 
get a personal satisfaction out of it. You feel you’ve done something for the 
school system. For the kids. You’ve had some input. You’ve contributed 
something.” 

It is interesting to note, however, that the teachers did not identify their 
two greatest satisfactions—interaction with other educators and making a 
contribution— as reasons for joining their committees in the first place. This 
is consistent with their initial lack of knowledge about Alberta Education and 
the way it functions. 

Dissatisfactions. The teachers’ dissatisfactions also reflected naiveté 
about committee work at the provincial level. To begin with, there was 
frustration with the slowness and inefficiency of committee work. For ex- 
ample, the teachers spoke negatively about the rehashing of decisions. This 
was partly due to the need to remind committee members of decisions made 
at their last meeting (which may have occurred several weeks earlier) and to 
apprise new committee members of the work already accomplished. But 
another cause was a lack of certainty about the route the committee was 
taking. As one teacher explained, “When they decided to go that route, they 
were still not ready to go that route wholeheartedly, so consequently we may 
end up with people wanting to redebate it again which will delay us again. To 
me that’s inefficient.” The teachers did not seem to be aware that, from a 
political standpoint, it may be desirable for all committee members to be in 
agreement about the course they are taking and to present a united front to 
the various interest groups, even if it does take additional time. 

The provincial teachers also expressed dismay with the hierarchical con- 
straints affecting their work. They discovered that, even though they were 
serving on committees at the highest level of the educational hierarchy, the 
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work they did was subject to review and revision. The teachers deplored the 
lengthy review procedure (which they referred to as “red tape”). The changes 
required of their committees ranged from revisions in the wording and 
format of curriculum documents to substantive changes in content. The 
teachers perceived the minor changes as unnecessary detail and the major 
changes as tampering with the validity of the curriculum which they had 
developed. As one teacher explained, “At every stage somebody who has not 
been on the committee and does not know what the thinking of the commit- 
tee is has the power of coloring the program and then adjusting the program 
which may in fact remove some of its validity.” 

The teachers were particularly dismayed when confronted with the power 
of the Minister of Education. For example, when a committee was working in 
a politically sensitive area such as social studies, the Minister had been 
known to step in with specific directives, and those directives had to be taken 
seriously. As a result, said one teacher, “You’ve now got to change horses in 
midstream.” This created a dilemma for committee members whose view of 
the subject area differed from the Minister’s. “I didn’t realize,” a teacher 
commented, “that one pronouncement from one man or even a government 
could just totally change us around.” 

When such basic changes were made in a committee’s work, the teachers 
registered considerable cynicism. As one remarked, “If someone else is simp- 
ly going to make up a program and move it on top of the one that the 
committee worked through, well, my first impression is “Why didn’t you do it 
in the first place?’” There was also skepticism regarding the source of the 
changes required. One teacher explained that, “The government, because it 
is political, they listen to the public and the public does not know exactly what 
[this subject area] is. They know what they were taught when they were 
students.” 

Another dissatisfaction surfaced when the Minister set “unreasonable 
time lines” which resulted in a lack of quality in the committee’s work, a lack 
of credibility with teachers in the field if the time line was not met, and a 
failure to develop sufficient materials for implementation of the curriculum. 
One teacher who had served on other provincial curriculum committees had 
drawn the conclusion that the Minister was uninformed about the amount of 
time needed to perform a particular task well. “He didn’t really understand 
the parameters,” she remarked. “I feel that his department has not made the 
parameters clear to him.” 

The teachers were also concerned—and perplexed—about a perceived 
lack of financial support for the implementation of the curricula they were 
developing. As one teacher explained, “You put together a series of ideas and 
then, damn it, the stuff isn’t there. The resources are not available for 
teachers.” The teachers themselves believed that insufficient financing was 
partly due to a lack of clarity on the part of Alberta Education regarding 
responsibility for curriculum implementation in the province. They evidently 
did not realize that the major responsibility for implementation lies with the 
institutional, not the societal, level. 
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Institutional (District and School) Level 


Goodlad and Associates (1979) conceptualized the institutional level as in- 
cluding both the school district and individual schools. In certain respects, 
the teachers participating in curriculum committees in both contexts re- 
sponded in a similar manner to participation. However, certain differences 
were noted as well. 

Descriptions of the committees. Eight teachers had recently participated 
in school district curriculum committees. These committees were formed 
without exception by central office personnel, usually a subject area consult- 
ant. The consultants were responding primarily to an Alberta Education 
mandate, to changes in the subject area, and to innovative ways to teach the 
subject area. They were also sensitive to the concerns expressed by teachers 
as they visited in the schools. 

Five of these teachers had been asked to serve on their committees by 
central office personnel. Two had heard about the committees by word of 
mouth and volunteered to serve, and one said participation was mandatory 
for all teachers in her subject area. 

The other eight teachers had served on curriculum committees in their 
own schools. These committees were formed in response to teachers’ frustra- 
tion with subject area curricula. In two cases there was a need to eliminate 
overlap from one grade level to another. Other committees were formed 
because insufficient or inadequate resources had been provided for imple- 
mentation of a subject area curriculum. The teachers, naturally, complained 
about these difficulties and, as one teacher explained, “It got to the point 
where the principal was picking up that there was a general consensus that 
this was happening between grades and felt it was time to really act on it.” At 
that point, a committee would be formed. 

Five of these teachers had volunteered to serve on their committees, while 
three participated because all the teachers in their grade levels or subject 
areas were involved. Collegiality was apparently an influence on participa- 
tion at the school level. As one teacher remarked, “Everybody was just sort of 
helping everybody else out.” Said another, “It seemed fair for each teacher to 
take part of the responsibility.” Committees at both the district and school 
levels were engaged in similar tasks. Some committees attempted to make 
the provincial guidelines more usable by straightening out what would be 
taught at each grade level. In the course of this work, objectives were added, 
rearranged, and emphasized. The other committees approached implementa- 
tion more directly by developing units, gathering resources, and pilot testing 
either a new curriculum or a series of textbooks. The only curriculum work 
which was unique to an individual school was the development of school-wide 
instructional themes such as “Alberta’s Mosaic.” 

Motivations. The teachers joined their committees for a variety of rea- 
sons. One motivation was the opportunity to acquire information, ideas, and 
materials with immediate application to their classrooms. The teachers look- 
ed forward to obtaining such tangible materials as science equipment, pack- 
ages of resources, and a clearer sequence of objectives. One teacher, for 
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example, said she anticipated “Putting something together at my grade level 
that I knew that I would be able to use.” 

A second motivation was interest in the specific work the committee was 
doing. When asked why she joined her committee, one teacher replied, “It’s 
rather a selfish reason. It pushes me into reading children’s books.” Said 
another, “I like working with drama.... And things like taping, I enjoy doing.” 
It is worth noting that these teachers, unlike their provincial counterparts, 
were able to anticipate what their committees would be doing and whether 
the work would be of interest to them. 

A third motivation—the need for interaction with other educators—was 
identified only by teachers serving on district-level committees. Apparently, 
some teachers felt isolated in their own schools, and they needed the stimula- 
tion of contact with teachers whom they didn’t see on a regular basis. As one 
teacher explained, “I felt that, if I wanted to be doing a good job in my 
classroom, then I needed that external contact to better provide for my 
students.” Perhaps the teachers serving on curriculum committees in their 
own schools did not mention the need for interaction because they were in 
touch with their peers on a daily basis. As one teacher said about her school, 
“The grade 1 group was very interactive anyway, and we sort of participated 
in everything to help everybody out.” 

A fourth motivation—the desire to be involved in decision making—was 
more important to teachers serving on school-level committees than on 
district-level committees. The teachers working on committees in their own 
schools saw clearly that the decisions made by their committees would have 
considerable effect on their classrooms, and it was worth their while to swing 
the decisions the way they wanted them to go. As one teacher explained, “I 
had a basic idea of what should be included in a good reading program.... I 
didn’t want the focus to go totally onto the phonics.” Said another, “I didn’t 
want to find myself with a science text that I really didn’t like.” 

Why did only one teacher serving on a district-level committee identify 
the desire to be involved in decision making as a reason for joining his or her 
committee? A clue may be provided by the fact that the decisions made and 
the materials produced at the district level were optional for teachers 
throughout the district. Therefore, it is possible that the teachers did not feel 
themselves involved in decision making which would clearly influence what 
teachers did at the classroom level. 

One other finding is worth mentioning here. Curiosity about the cur- 
riculum development process was identified as a motivator by some provin- 
cial teachers. However, not one local teacher expressed such curiosity. It may 
be that the teachers had observed local curriculum committees in operation 
and knew—or thought they knew—how such committees functioned. 

Satisfactions. The teachers agreed with their provincial counterparts that 
the greatest satisfaction gained from participation on their curriculum com- 
mittees was interaction with other educators. Again, the teachers derived 
enjoyment from being with other people, from sharing ideas, and from 
realizing that other people had similar concerns. As one teacher explained, SL 
liked being able to talk to teachers who were also struggling in terms of 
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student motivation ... and also realizing that I wasn’t the only one in the 
whole show that was developing this same awareness regarding students.” 

The teachers also indicated that interaction on a curriculum committee 
was different from the daily interaction in the staffroom. For example, one 
said she enjoyed “Just getting to know the staff members that much better.... 
More comes out on a curriculum committee about another teacher’s phi- 
losophy of teaching than any other time. So much of the staffroom banter is 
just very specific, you know—‘Well, what did you do this morning and how 
did that work?’—whereas if you sit down with people on a curriculum com- 
mittee, they really come across with what they think should be in this part 
and why. I found that interesting.” 

Professional growth was another major satisfaction which the teachers 
derived from participation. They were clearly thinking in terms of their own 
classrooms. For example, some teachers welcomed the opportunity to study 
the curriculum in more depth and to synthesize their ideas. The teachers also 
found it helpful to put their classrooms in the broader context of the school. 
As one teacher remarked, “You have some sort of grasp of what is happening 
in your school when you are on a committee like that.” 

Dissatisfactions. The teachers serving on district and school committees 
expressed different dissatisfactions in two respects. 

Teachers serving on district committees, like their provincial counter- 
parts, registered dissatisfaction with hierarchical constraints. They found, 
for example, that their work was vulnerable to changes at the provincial level 
such as revised objectives for a subject area. As a result, “Certain programs 
have to be scrapped or entirely rewritten because they no longer conform to 
those objectives.” The teachers also felt constrained by central office which 
had the power to give released time, to set time lines, to determine the kind 
of work the committees would do, to establish the format of the materials 
produced, and even to require a teacher’s participation on a committee. In 
one case, a teacher perceived that his committee was a “rubber stamp” in 
which the teachers began to feel, “Well, what does he want us to say now?” It 
is interesting to note that teachers serving on committees in their own 
schools were apparently free of such hierarchical restrictions. 

On the other hand, the teachers at the school level, like their provincial 
counterparts, registered frustration with the slowness and inefficiency of 
committee work. However, this frustration was caused not by bureaucracy, 
but by other committee members. Unfavorable comments were made about 
teachers who had different ways of working or different perspectives or who 
failed to do the work they had promised to do. For example, one teacher 
found considerable difference between her interest in “how the concepts 
were developed and built upon” and the other teachers’ concern for “Can I do 
it with the kids? Is this going to work?” Such differences caused time delays, 
disappointment “when somebody else’s ideas are used, and they may not be 
as good,” and occasionally a negative feeling that one’s approach is con- 
sidered irrelevant by the other teachers. The failure of a committee member 
to shoulder his or her share of the load was also frustrating. “Not everyone 
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was available to come to meetings or, when they came, they didn’t have their 
things done.” 

Although there were differences in the dissatisfactions expressed by 
teachers serving on district and school committees, the major dissatisfaction 
of both groups was the lack of time to devote to their committee work. The 
problem was acute because these teachers ordinarily received little or no 
released time for their work, so they squeezed their committee meetings into 
free times such as noon hours, after school, and evenings. As a result, the 
work of the committees was not given sufficient priority and quality deteri- 
orated. As one teacher explained, “Quite often you would be just in the very 
middle of something and time would run out.” The teachers’ preparation for 
meetings also suffered. Said one, “It was really hard to be really thorough a 
lot of the time.” As a result, said another, “Sometimes the end product isn’t 
as good a product as it could be because the time lines and the energy 
resources available are too restricted.” At its worst, the teachers’ morale 
plummeted. As one teacher described it, “I was very busy, and I thought this 
is just something added to the whole array of things that I was doing, and 
sometimes I found that it was just a bit too much. I think that was the 
problem with most people.” (It is worth noting that the work of two school- 
level committees was never completed.) 


Discussion 

The purpose of this discussion is to offer practical suggestions to admin- 
istrators who are responsible for the operation of curriculum committees at 
societal and institutional levels. The suggestions are based on the finding 
that teachers serving on curriculum committees at societal (provincial) and 
institutional (district and school) levels did indeed differ with respect to their 
motivations for joining their committees as well as the satisfactions and 
dissatisfactions which they derived from the experience. 

It appears that, as curriculum development becomes more remote from 
the classroom, its character changes. It moves from practical to philosophi- 
cal, from self-regulated to regulated by others, and from personal to imper- 
sonal. Thus the teachers serving on societal (provincial) level committees 
were more curious than their local counterparts about the functioning of 
curriculum committees, more awestruck by the perceived power of the com- 
mittees, more delighted to find they could make a contribution to their 
committees, and more dismayed by the political and hierarchical constraints 
which controlled their work. However, they rarely complained about the 
feasibility of participation because released time was customarily provided 
for committee meetings. 

At the institutional (district and school) level, however, the teachers 
joined their committees because they expected to obtain practical informa- 
tion, ideas, and materials which they could apply to their classrooms. They 
knew—or thought they knew—how the committees would operate, so they 
expressed no curiosity about curriculum development or surprise when they 
made a worthwhile contribution to committee work. They were also in 
greater control of the work of their committees. Dissatisfactions, however, 
took on personal overtones, and committee work suffered from lack of time. 
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Three suggestions are made to administrators with regard to these find- 
ings. The first suggestion is that, as curriculum development moves up the 
educational hierarchy, an effort must be made to explain to teacher par- 
ticipants the context in which their committees will be working. For example, 
because teachers are not familiar with the machinations of provincial depart- 
ments of education, it is important to explain the constraints which will 
shape the work they do. In particular, the political need to satisfy a variety of 
internal and external groups and the bureaucratic need to follow certain 
procedures must be laid on the table. The data showed clearly that unex- 
pected and unexplained constraints led to negative feelings on the part of 
teacher participants, including the perception that they were being used for 
ulterior motives. These feelings may affect the quality of the teachers’ work 
and their willingness to serve on curriculum committees again. 

It is worth noting that this suggestion applies to district-level committees 
as well as provincial committees, although perhaps to a lesser extent. In fact, 
one of the interesting findings of the study was that, although Goodlad and 
Associates’ (1979) institutional level included both the district and the 
school, in many ways the district committees were more similar to provincial 
committees than to those in individual schools. For example, the committees 
were initiated and administered by people other than the actual participants, 
teachers were usually invited (or required) to join the committees, and the 
work of the committees was subject to.hierarchical control. 

The second suggestion is that, as curriculum development moves down 
the educational hierarchy, more support must be given to the curriculum 
committees. There is an obvious need for curriculum committees to be given 
time for their work as well as assistance so they can do their work well. 
However, the granting of time and assistance is important in another way; it 
says clearly to teachers that their curriculum work is of value. The need for 
recognition—for moral support—is particularly evident at the school level, 
for the closeness of committee work to the teachers’ classrooms made some 
teachers doubt that they were doing curriculum work at all. One teacher 
commented that when she joined her committee, “I wasn’t thinking of it as 
going in to doing curriculum work. I was thinking of it as a project that we 
might find fruitful to do.” Another teacher questioned that her committee, 
which was identifying and organizing the equipment needed at each grade 
level to implement the science curriculum, was really a “curriculum” com- 
mittee. “I think it’s just one of those things that has got to be done.” This 
particular teacher was interested in getting into curriculum work, which she 
called “the big stuff,” at the district or provincial level. The decrease in status 
as curriculum development moves down the educational hierarchy is a major 
problem for administrators to consider. (See Young, 1988 for expansion of 
this finding.) 

The third suggestion stems from the finding that the major satisfaction of 
teachers at both societal and institutional levels was their interaction with 
other educators. The interactors, the types of interaction, and the focus of 
interaction differed from one level to another, but in all cases the teachers 
clearly enjoyed sharing ideas and being exposed to approaches to curriculum 
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and instruction which were different from their own. It is suggested, then, 
that administrators capitalize on this need on the part of teachers. At the 
provincial level in particular, recruitment procedures should apprise teachers 
that interaction among committee members is one of the bonuses of commit- 
tee membership, for the data indicated that teachers did not know this would 
be the case. 

School administrators need to be aware, however, that interaction among 
teachers on school-level committees can be sullied by the fact that the teach- 
ers may know each other only too well. It was pointed out earlier that 
dissatisfaction at this level took on personal overtones. It is possible that 
interaction would be more positive at the school level if principals themselves 
took a more active role in committee work, awarded more status to cur- 
riculum committees, and arranged for committee members to interact with 
teachers in other schools and with consultants from central office. Such 
contacts would address the strong need the teachers expressed to interact 
with other educators on a professional basis. 
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There is widespread concern on the part of practitioners that curriculum consult- 
ants are not as effective as they could be. Researchers have tended to neglect the 
role: The behaviors that characterize effective consultants have not been identified. 
The purpose of this study was to bring to the surface some of these behaviors by 
focusing on differences in the strategies reported by experienced and inexperienced 
consultants. Data sources for the study were verbatim transcripts of semistruc- 
tured interviews conducted with 12 curriculum consultants (half experienced, half 
inexperienced) from two large school boards and six senior administrators from 
the same boards. Categories of analysis evolved from the data and were organized 
in a four-phase model of consultation: initiation, planning, delivery, and follow- 
up. This study found similarities and differences that were interpreted in an 
expert/novice framework. The characteristics that distinguished experienced from 
inexperienced consultants concerned overt strategies that were used to bring about 
change. There was little evidence that the crude quantitative measures (amount of 
consulting or time spent) identified in previous studies distinquished between 
experienced and inexperienced consultants, The study found that the tactics of 
consultants were affected by the settings in which they worked. The most sophisti- 
cated strategies were available only to consultants working in enlightened school 
systems. But in contrast with researchers who have argued that effectiveness can 
only occur in an enabling institutional context, the study concluded that many of 
the differences between experienced and inexperienced consultants were within the 
control of individuals. 


Motivation for the Study 
An educational consultant is an educator who helps teachers and other school 
system agents perform their responsibilities more effectively. Various labels 
are used to describe the role, and the specific expectations for consultants 
vary considerably from district to district. A review of the literature? found a 
vast amount of descriptive and prescriptive material and a relatively small 
set of empirical investigations. 
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Curriculum consultants are widely regarded as having a critical role in the 
improvement of educational practice, yet there is significant dissatisfaction 
with consultant performance. A major source of dissatisfaction is derived 
from consultants’ uncertainty about role expectations (Dow & Whitehead, 
1981; Stennett, 1974) and disagreements between consultants and superin- 
tendents concerning how consultants spend their time (Leithwood, Ross, & 
Montgomery, 1982). Another contentious issue is the relevance of consult- 
ants’ work. There is evidence that teachers’ educational concerns are hand- 
led within the school, not by consultants (Holdaway & Millikan, 1980) and 
that consultants spend a disproportionate amount of their time working with 
teachers least in need of assistance (Alpert, Weiner, & Ludwig, 1979). Then 
there are those who question the impact of consultants on curriculum 
decision making and their ability to bring about school change (Firestone & 
Corbett, 1981; Fullan, 1981; Regan & Winter, 1978). One result of dissatis- 
faction with the performance of consultants is the frequently observed prac- 
tice of reducing the number of consultant positions and/or filling them with 
short-term secondments. 

Most studies of consultation report that consultants have an impact on 
teacher skills and practice (Bancher, Eiseman, Cox, & Schmidt, 1982; Con- 
oley & Conoley, 1982; Curtis & Watson, 1980; Griffin & Barnes, 1986; see 
also the review of 29 studies by Medway, 1979). However, Curtis and Watson 
(1980) found that only highly skilled: consultants contributed to teacher 
growth. Effects on student behavior have also been reported (Bergan & 
Tombari, 1976; Conley, 1983; Mannino & Shore, 1975; Randolph & Wallin, 
1973); in contrast, Griffin and Barnes (1986) found that consultants’ work 
with teachers had no measurable effect on students. 

Several researchers have attempted to identify characteristics of consult- 
ation associated with teacher effects. The duration of consultant-teacher 
contact has been consistently linked to teacher growth (Curtis & Watson, 
1980; Louis, Rosenblum, & Molitor, 1981; Miles, Fullan, & Taylor, 1978; 
Runckell & Bell, 1976; Tyler & Fine, 1974). Investigators making finer 
distinctions with respect to the time variable report that effective consult- 
ants spend more time in inservice and modeling activities (Loadman & 
Mahan, 1972) and that consultant effectiveness is associated with how con- 
sulting time is used rather than the total amount of time given (Louis & 
Sieber, 1979). 

Studies attempting to identify differences in the effectiveness of stra- 
tegies used by consultants have had limited success. The Miles, Saxl, and 
Lieberman (1988) attempt to distinguish between strategies used by consult- 
ants, rated by project managers as either outstanding or average, found only 
small and inconsistent differences. Similar findings were reported by Fires- 
tone and Corbett (1981) and Louis and Kell (1981). The few intervention 
studies attempting to systematically vary the strategies used by consultants 
have reported scattered findings: Cossairt, Hall, and Hopkins (1973) found 
that praising teachers’ implementation efforts was more effective than other 
feedback strategies; Conoley and Conoley (1982) reported that joint consult- 
ant-teacher observation of students was more effective than having the 
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teacher alone describe student behavior; Bergan and Tombari (1976) found 
student and teacher effects for the number of different techniques recom- 
mended by the consultant to the teacher and for the extent to which the 
consultant elicited responses from teachers. 

Previous research has found that consultants can have an impact on 
teachers and students, but there is a lack of knowledge about the specific 
behaviors that distinguish effective from ineffective consultants. The search 
for effectiveness—in schools, teachers, principals, and supervisory officers— 
has not been matched with a quest for the effective consultant. What little is 
known about the characteristics of effective consulting is based on gross 
variables such as the effect of the presence of consultation or the amount of 
consultation. 


Focus of the Study 


The purpose of the research reported here was to identify some of the specific 
behaviors that distinguish effective consultants by focusing on differences in 
the strategies reported by experienced and inexperienced consultants. The 
rationale for using experience as a proxy for effectiveness was based on 
several considerations: 

1. In the region in which the research was conducted only a few consult- 
ants—those with particularly sterling records—were given a renewal of 
responsibility beyond a two- to three-year term. 

2. Supervisory officers reported that the major change in consultant effec- 
tiveness occurs in the first two years in the role. 

3. There were significant shortcomings in using other methods for select- 
ing respondents, for example, ethical review committees require that 
respondents selected on the basis of their reputations for ineffectiveness 
be so informed. 


Study Methods and Procedures 
Sample 


In Ontario the Ministry of Education provides a policy framework in the 
form of curriculum circulars, guidelines, and support documents for the 
development, implementation, and evaluation of instruction. Ontario school 
boards operate in the framework with substantial discretion (in comparison 
with other provinces). Board A in the sample promoted tighter linkages than 
other Ontario boards between its programs and Ministry documents on the 
one hand and between roles within the system on the other. This board 
developed a detailed set of structures, procedures, and a long-term operating 
plan (updated annually) to govern the actions of teachers, administrators, 
and support staff. Considerable emphasis was attached to coordinating the 
efforts of curriculum consultants with those of principals so that teachers did 
not receive competing messages about system expectations. Experience in a 
consultant role had recently become a prerequisite for promotion to prin- 
cipal. The system was less tightly coupled in practice than in theory—excep- 
tions to the intended model were easily found—but Board A was making a 
serious attempt to bring its standard operating procedures into line with its 
ideals. Board B in the study was less centralized. Responsibility for the 
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articulation and implementation of curriculum policy was inconsistently 
shared by line and staff incumbents. In some schools consultants and ad- 
ministrators worked together to give teachers a consistent message; in others 
consultants worked at the direction or with the indifference of the principal; 
in a few cases there was overt competition. This board was moving toward 
detailed procedures for curriculum action, but it was experiencing a gap 
between its senior echelon and middle management. In both boards there 
were a variety of consultant roles. Some roles had a subject focus for all 
grades (e.g., JK-13 Art); others focused on a subject for selected grades (e.g., 
Intermediate/Senior English) or addressed all subjects in a few grades (e.g., 
K-38). Psychoeducational consultants, unlike curriculum consultants, tended 
to focus on the needs of specific children in special populations and were 
more likely to spend a portion of their time delivering instruction to children. 
In each board some consultants had quasi-supervisory responsibility for 
other consultants and were more likely to work with groups of teachers than 
with individuals. 

The data consisted of verbatim transcripts from semistructured inter- 
views conducted with three experienced and three inexperienced consultants 
in each of two large Ontario school boards (n=12). Although participation in 
the project was voluntary, the consultants? were nominated and initially 
contacted by senior administrators. Characteristics of participating consult- 
ants are summarized in Table 1. 

These data were supplemented with other sources: 

1. A-series of pilot interviews was held with consultants from boards not in- 
cluded in the study in order to develop instruments. 

2. Single interviews were held with the director, superintendent of cur- 
riculum, and an area superintendent in each of the two boards. 

3. Materials provided by the consultants and administrators (job descrip- 
tions, examples of activities, and written products) were also examined 
to illuminate the context of the consultants’ work. 


Instruments and Procedures 


The study began with an unstructured group interview with four consultants 
not involved in the formal data collection. The purpose was to get a sense of 
the language and concepts used by consultants in talking about their work. 
Interviewees were invited to discuss successes and failures they had experi- 
enced relative to the statutory requirement that they “assist teachers in 
maintaining proper standards and improving instruction” (Government of 
Ontario, 1984). They were also asked to identify factors that contributed to 
success or failure in the role. For example, consultants said that “laid-on 
workshops” or “one-shot awareness sessions” rarely had the desired effect; 
much more successful were events that came in response to genuine teacher 
requests. The wide-ranging interview touched on a variety of issues such as 
how the objectives for a consultation are determined, relationships with 
principals and teachers, differences between one-to-one and group consult- 
ation, evaluation concerns, stressors, rewards, and images of the consultant’s 
role (e.g., “You know the story of Bartholomew Cubbins and the 500 hats 
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Table 1 
Summary of Main Consultant Sample Characteristics 


eee 


Item Experienced (6) Inexperienced (6) 
1. Number of years in At least 3 years for all six; Half year for two, 
present position more than seven years for four 0 years for four 
ee ee ee ee eae ee 
2. Consulting/ All six held prior consulting or Two held Department Head 
Administrative administrative positions positions; four had no 
experience prior to prior consulting or 
present position administrative experience 
3. Program areas and One concerned with all One concerned with a 
levels of present curriculum areas, K-6; specific area (Outdoor and 
responsibilities Five concerned with Environmental) K-13; three 
specific areas (Languages, responsible for all 
Arts, Gifted) with three curriculum areas K-6/K-8 
having K-13 responsibilities and two concerned with 
and two having 3-13 specific areas (Guidance, 
responsibilities Technical) 7-13 
4. Years of classroom Mean = 20 years Mean = 15 years 


teaching experience 


5. Academic background All six have undergraduate All six have undergraduate 
degrees and five have degrees and two have 
M.Ed. degrees M.Ed. degrees 


[Geisal, 1938]? Well, that’s us. Only every time you take one hat off, there’s 
about three more to take its place.”) 

The transcript from the three-hour interview was used to identify ques- 
tions and probes for a guide for interviewing individual consultants. This 
guide was piloted with two additional consultants—one experienced, the 
other inexperienced—not involved in the formal data collection. Further 
refinement followed analysis of these transcripts. 

The refined instrument (requiring 60-90 minutes to complete) was ad- 
ministered to the 12 consultants from Boards A and B. Respondents were 
invited to describe a situation in which they were particularly effective in 
one-to-one and in group consultation. For each example they were asked to 
identify the characteristics of the situation which contributed to success, the 
purposes served by the consultation, and to generalize from the case. They 
were also asked to describe how other school or board personnel affect their 
work, for good or ill, and to respond to some terms used by the pilot inter- 
viewees to describe their work with teachers (e.g., demonstrating, sharing, 
problem solving) and images for the role (e.g., Bartholomew, Geisal, 1938). In 
discussing their activities respondents described a three-part consulting pro- 
cess consisting of planning, delivery, and follow-up. Comments of inter- 
viewees were collected in each phase, and patterns were identified. 
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Unannotated transcripts from the first interview were returned to the inter- 
viewees for further elaboration. 

A second interview guide was designed to address areas of uncertainty. 
The second interview (also requiring 60-90 minutes to complete) focused on 
the three phases of consultation using the situations given by respondents in 
the first interview as stimuli. For each situation, specific actions taken by the 
consultant were probed in considerable detail. In the follow-up phase, for 
example, respondents were asked: Who determines follow-up needs? How are 
they determined? What kinds of activities constitute follow-up? Is follow-up 
provided for in the initial planning of the consultation? Is attention given to 
sequencing or recycling either in the initial planning or as follow-up progres- 
ses? Is there a decision that follow-up is proceeding satisfactorily or is com- 
plete? If so, who makes it and how? Unannotated transcripts of the interview 
were once again returned to the interviewees for comment and correction. 

Each consultant completed a self-administered questionnaire concerning 
professional experience, academic background, and training for the consult- 
ant role. They were also asked to bring any materials that would help the 
researchers understand individual consultant responsibilities. 

The interviews with senior administrators addressed the selection criteria 
for appointing consultants, performance expectations, training opportun- 
ities, and development anticipated in the role. The administrators in both 
boards looked for people with “the ability to have it happen in their own 
classroom and the ability to help somebody else have it [happen] in the 
classroom.” They wanted consultants who could conceptualize curriculum 
processes (“because they have to interact with people who are doing that all 
the time”), but they have to be practical (“capable of taking the concept and 
translating it into action”), aggressive (“requires a person who isn’t a shrink- 
ing violet”), and focused (“I see people being very selective about the work 
they do—being very purposeful in ensuring that there is some kind of collec- 
tive significance”). When responding to teacher requests, consultants are 
expected “to bring to that setting a system perspective and give them help 
that is perhaps broader than is asked for in the first place and is in keeping 
with the system direction and not totally in response to the initial question.” 
Administrators also received unannotated transcripts of their responses. 

Full-day meetings were held with the consultants from Boards A and B to 
obtain feedback on the interpretations of the data. Revisions were examined 
in subsequent half-day meetings. Shorter meetings were held separately with 
the administrators of each board. 


Coding and Analysis 


The analysis began by pairing Board A and B consultants so that each pair 
consisted of one experienced and one inexperienced consultant who differed 
as little as possible with regard to demographic features such as board, age, 
sex, subject area, and other factors. There were no perfect matches—there 
was, for example, a confounding effect of postgraduate training—but the 
procedure minimized sources of variation other than consultant experience. 
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Interview transcripts were compared to identify similarities and differen- 
ces. The search was guided by a framework which represented consultation 
as an instructional act for which objectives, teaching strategies, resources, 
and evaluation procedures could be identified. A more detailed account of the 
instructional system is contained in Robinson, Ross, and White (1985). 
Similarities and differences identified in each pair served as hypotheses 
which were tested in the transcripts of the remaining pairs. These hypotheses 
were not developed a priori—the existing literature provided insufficient 
direction for a confirmatory approach. In contrast the study was exploratory: 
Hypotheses were developed, discarded, and refined in successive iterations 
throughout the data. The decision rule for acceptance of each hypothesis was 
75% confirmation (i.e., the hypothesis was accepted if the data were consis- 
tent with the hypothesis in 9 of 12 cases). 


Findings 

Similarities Between Experienced and Inexperienced Consultants 

Although the focus of the study was on differences between consultants, a 
number of similarities emerged. Eight themes, summarized in Table 2, met 
the criterion for acceptance. (The checks in the table indicate that the 
respondent articulated a view that was consistent with the rest of the sample. 
The Xs indicate that the view expressed was inconsistent or that no data 
were provided; the question marks indicate ambiguous data.) 


Planning 


1. In setting objectives, all but one consultant indicated they listened and 
responded to teacher needs. For example, an inexperienced consultant re- 
ported that “We drew them [teachers] out to find out exactly what they 
wanted ... [consultants] really have to listen to what teachers want,” and 
another indicated how important it was “to be able to meet the needs of as 
many individuals as possible.” Similarly, experienced consultants said 
“teachers resent not being in on the planning” and consultants must be 
“prepared to listen to the teacher.” 

Getting teachers to express their needs could be a significant task. An 
inexperienced consultant “asked a lot of questions to establish exactly what 
they wanted.” An experienced consultant said she will “tend to hold off ... 
until I get enough feeling from the field that there are people who have 
identified their need.” Another described a consultation that failed “because 
it wasn’t coming from them ... it was coming from somebody up there.” 

2. In presenting new ideas to teachers, both groups of consultants used 
expert teachers. Experienced consultants talked about “bringing in teachers” 
and using “teams of teachers”; for example, “I have some teachers I bring in 
and we write curriculum and [they] do workshops for me.” Inexperienced 
consultants reported similar strategies: “planning is done with a group of key 
teachers”; “I say to them, if you have some people ... that have expertise, we 
need some volunteers.” 

3. The main method used by both groups to present how-to information to 
teachers was to provide hands-on activities: “direct activities that they could 
use and see how they would work.” Often these activities were brought to the 
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Table 2 
Similarities in Strategies Reported by Experienced and 
Inexperienced Consultants 


Experienced Inexperienced 
Dimensions Board A Board B Board A Board B 

Planning 

1. Consultants listened to teachers and responded VW VvVv VW xvV¥ 
to their needs. ; 

2. Consultants used expert teachers in presenting VW VW VXX VW 
new ideas to teachers. 

3. Consultants used hands-on activities to present vvV7 WX vv VW 
how-to information to teachers. 

4. Consultants used large group presentations to WV ov V/V VV 
provide general information about new 
programs. 

Delivery 

5. Consultants enlisted teacher motivation. VVV AMA Wx Vx 

6. Consultants tried to avoid coming on like 1 ey Wx VW VV 


experts to teachers. 


Evaluation & Follow Up 


7. Consultants found that principals had a V/V o/ WV WV 
big impact on the success of consultants. 


8. Consultants found that one shot workshops WV 2XV WV VV 
were not effective. 


Code: V consistent 
X inconsistent 
? ambiguous 


session by expert teachers. An inexperienced consultant had teachers spend 
“90% of their time sharing, sharing resources, sharing how do you do this, 
and how to you go about doing that.” A similar strategy was expressed by an 
experienced consultant who said “there are many different strategies you can 
use for workshops and they’re all hands-on.” 

4. The method preferred by both groups for presenting general informa- 
tion about new programs was the large group session. The view of experi- 
enced consultants that “the large group meetings [are] for situations where 
you have to get information out to the whole group at once” was shared by 
inexperienced consultants who reported that large group sessions were for 
when “there is a new program, new Ministry expectations, then obviously 
you want your teachers to be aware of what the expectation is”; “you want to 


b 


get the sense ... hey, we are all moving in the same direction.” 
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Delivery 


d. In delivering consultation both groups stressed the importance of gen- 
erating teacher motivation. Inexperienced consultants said that “a motivat- 
ed teacher is important for change”; they “have to see a reason why.” The 
experienced articulated similar views: “teacher interest is the key”; consult- 
ants have to provide “in-service to motivate teachers.” 

Part of the motivational dynamics reported by both groups is the develop- 
ment of a trusting relationship between the consultant and teachers. “Trust 
is the first thing” for one of the experienced consultants, a view echoed by an 
inexperienced consultant who talked about the importance of “the openness 
and trust and feeling [that] yes, we can work together.” 

6. Modesty in interacting with teachers was recommended by both groups: 
A consultant who “comes on” like an expert can alienate teachers. Inex- 
perienced consultants talked about efforts to counter or avoid the expert 
image: “This notion of standing up there as head honcho ... I think I delib- 
erately tried to move away from that perspective”; and from another, “I tried 
to establish sort of a relationship ... as a friend as well as a consultant.” Two 
experienced consultants making reference to the “expert” issue suggested 
that the consultant be careful “not to lord it over people ... [they] get turned 
off” and “clients can distrust if [the consultant is] seen as an expert.” 


Evaluation and Follow-Up 


7. Most consultants agreed that the principal could be a positive force or 
an obstacle mediating the impact of the consultant. An inexperienced con- 
sultant found that “the principal is a key factor in how the whole school feels 
about support people and having them in the school.” Experienced consult- 
ants provided similar comments: “the principal has a big impact”; “it’s 
important that the principal be on board”; “if [the] principal doesn’t go to the 
in-service [the consultant] might as well not go.” 

8. Both groups agreed that one-shot workshops are not effective. All 
inexperienced consultants talked about the limitations of one-shot work- 
shops with comments such as “I don’t believe a one-shot workshop effects 
change” and “obviously when you give a whole group presentation and leave 
... you are not very effective.” The experienced consultants endorsed this 
strategy commenting on the need for a “sequence of workshops” with “fol- 
low-up,” candidly dismissing the value of briefer events: “one [shot] sessions 
are useless.” 


Differences Between Experienced and Inexperienced Consultants 


Some variations in consultant behaviors were associated with differences in 
job descriptions. For example, experienced consultants had substantial devel- 
opment responsibilities in addition to implementation functions, whereas the 
job descriptions of the inexperienced consultants directed their efforts to 
putting the board’s curriculum into use. The study also found 10 differences 
that appeared not to be related to tasks assigned by the system but to the 
methods that consultants used to perform them. These differences (sum- 
marized in Table 3) tended to be smaller in Board A possibly because the 
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Table 3 
Differences in Strategies Reported by Experienced 
and Inexperienced Consultants 


Experienced (E) Inexperienced (I) 
Dimensions Board A Board B Board A Board B 


Planning 


1. (E) derived objectives from system plan; (I) VV VW X2V V/V 
had ad hoc initiatives based on own agenda 
& requests of teachers and principals. 


2. (E) provided consultation to a variety of Vv vv¥ XXV¥ VV 
roles; (I’s) consultation was limited to 
teachers. 


3. (E) individualized consultation to vi WA, VxXV a 
accommodate/a range of teacher needs; 
(I) treated all teachers the same. 


4. (E) provided practice activities within VVV Wiki Xx¥ ALL 
consultant sessions; (I) did not. 


5. (E) scheduled practice activities between VXV VXV VV WV 
consultation sessions; (I) did not. 


6. (E) arranged for teachers to share what they VvV. VX V2X V/V 
had learned with others as a strategy for : 
consolidating consultant input; (I) had 
no planned sharing or used sharing to 
introduce new ideas. 


Delivery 


7. (E) constructed multi-agent teams; (I) V/V VV VW XXV 
worked alone or with a single agent. 


8. (E) adjusted plans substantially in Vv VW Vxv VxV 
response to teachers feedback; (I) made 
modest adjustments or none at all. 


Evaluation & Follow Up 


9. (E) measured consultant impact in terms oVV VV VX¥ VW 
of teacher & student changes; (I) used 
looser, more affective measures. 


10. (E) used evaluation data to plan subsequent VV vx VV VV 
events; (I) focused on isolated events. 


Code: v consistent 
X inconsistent 
? ambiguous 


inexperienced had some experience in the consultant role, however brief, at 
the beginning of the study. In the table the checks indicate that the respon- 
dent articulated a view that was consistent with the rest of the group 
(experienced or inexperienced) to which he or she belonged. The Xs indicate 
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that the view expressed was inconsistent or that no data were provided; the 
question marks indicate ambiguous data. There were no instances of rever- 
sals, that is, contrasts in which the difference between experienced and 
inexperienced consultants was in the opposite direction to that found in the 
rest of the sample. 


Planning 


1. Experienced consultants used board priorities or a system plan to 
identify needs to be addressed in consultation: “there’s a plan ... you try to get 
them to plug into the plan and then if they move to grade 5 [or] down to grade 
2, then you look at the plan and say, all right, I have to expand this, I should 
be doing more of that.” The plan was used to develop a set of specific 
consulting objectives, in part by using the plan to interpret and select from 
lists of needs expressed by teachers. For example, one experienced consultant 
described how goals were set for a workshop series: The teachers were put in 
small groups and 


they were interacting about all the components of Whole Language ... then 
another primary consultant from the team and I sat down and took the levels 
of use profile titles.... We went through the raw data that we had collected 
from these people in their brainstorming sessions and we summarized every- 
thing that they had ... by doing that we were able to identify certain gaps. 


Experienced consultants responded to requests from individual teachers, 
groups of teachers, and principals, but they felt free to decline requests or 
give brief responses to those which deviated from the system thrusts. 


We don’t feel obligated to respond to the whims of principals when they don’t 
particularly fit into the initiatives. If we get that kind of request we may meet 
with the principal and try to put it into a different context. 


In contrast, the inexperienced consultants reported that the objectives for 
their activities were derived from their own personal agendas, their knowl- 
edge of teachers, or the expressed needs of consultees without overt reference 
to a system curriculum plan. For example, one inexperienced consultant 
reported that “the way I feel that I do things, I have to know people ... I would 
find it difficult if it was a very, very large board, I think that, with hundreds 
of teachers, then I don’t know what I would do.” Another inexperienced 
consultant indicated that his workshops replicate a provincial inservice 
program containing specific objectives: “It’s quite a program. Very prescrip- 
tive, pedagogy outstanding; it’s very tight.” 

Experienced consultants also searched more widely for data on teacher 
needs. This search for accurate data was most visible in one-to-one interac- 
tions. For example, 


usually before you go in you have a good idea of what the teacher would like 
you to do, so you do some phoning beforehand ... there’s your initial contact, 
you think about it, you phone back and talk some more. You also talk to the 
principal. That way everybody is brought on board and then you go in and you 
can help the teacher a little bit more because you have more information. 
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Conversations with teacher and principal were supplemented with observa- 
tional data collected by the consultant or principal to diagnose the specific 
needs more precisely. 

In contrast, inexperienced consultants employed narrower search proce- 
dures. They reported talking to the teacher involved in the one-to-one inter- 
action, but they accepted the teacher’s definition of the need as is, rather 
than treating the teacher’s expressed need as one piece of information to be 
added to others. They did not confer extensively with other agents in the 
system. 

2. Experienced consultants provided inservice to a variety of roles beyond 
the teacher. For example, 


last night in fact we did a parent presentation.... We’ve got a little routine 
together that we’ve said we are going to go on the road and do our road show. 
We started at the principals’ conference ... and we have done it for our 
superintendents and trustees and our principals and our vice-principals. 


Another experienced consultant reported explaining school system policies in 
workshops offered to professionals outside education (i.e., nurses). 

In contrast, inexperienced consultants provided inservice to teachers 
only. Some of the inexperienced consultants had a great deal of contact with 
principals, but as gatekeepers, not as consultees. For example, they contacted 
or were contacted by principals to negotiate entry into schools. When asked 
who decided the substance of one of her school workshops, an inexperienced 
consultant responded: “[It] was decided by the principal ... I met with the 
principal and we did some planning. He told me the level of awareness the 
people were at.” 

3. Experienced consultants were aware of the variability of teachers’ 
needs and responded by attempting to individualize consultation. Some of 
these individualization strategies involved addressing a range of needs in a 
single workshop. For example, 


differentiating actually in the workshop itself where one person might take 
one group off and do one particular thing and somebody else take another 
group, because we know the needs are different. 


Another strategy involved giving different resources to different individuals; 
for example, 


You’ve got to go back to ... the level of the teaching ability. If I know the 
personalities then I may already know of some things that I should go in with 
and I would do that, either take material in or have sources identified for 
them.... Of course, I always have more than one agenda. 


A related strategy involved a main message with asides for those not ready 
for the main message. 


When we write our curriculum and when we do our presentations, sure we 
are going to try to meet the needs of the people [to bring them to] the highest 
level. But we also want to catch a lot of these people in here [pointing to the 
profile] ... for example, I would say, “OK, if you are using a basal reader and 
you’re using it with round robin groups, or you’re just using workbooks for 
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follow-up, here are two or three other things that you could do that wouldn’t 
mean a lot of work. You could implement them in the next couple of days.” So, 


I guess we are aiming at getting here [pointing], but not overwhelming these 
people who are lower. 


Different types of experiences were used to accommodate teachers with 
different learning styles: lecture, demonstration, sharing, distribution of 
materials, audio and visual presentations—all in the same workshop. 
Separate experiences in separate sessions were also reported in order to meet 
the needs of different groups. For example, an experienced art consultant 
reported that he provided concrete experiences with a detailed procedure for 
teachers fearful of the art curriculum; for talented art teachers he provided a 
less structured approach that extended their interests to other domains or 
deepened their competence in the medium which was of greatest interest to 
them. 

In contrast, inexperienced consultants, with one exception, did not report 
strategies for attempting to meet the varying needs that exist in a diverse 
group of teachers. Only one relatively simple strategy, providing additional 
materials, was reported by an inexperienced consultant: 


You always have such a wide range of levels of awareness and program. I 
guess I’m trying to spark awareness in those people, so I present that. But I 
also want to provide some support for the people that are already doing the 
program; that’s why a lot of the materials [at the workshop are] for the people 
looking for something they can use. 


The difference between experienced and inexperienced consultants is 
partly attributable to the greater sophistication of experienced consultants in 
conceptualizing teacher differences. The experienced had complex levels of 
use profiles containing many dimensions with several performance levels in 
each. For example, “We have a program scale in the County ... it has criteria 
across seven categories of program.” The inexperienced used simpler 
schemes consisting of a few levels on a single dimension. For example, 
“You’ve got a group of teachers that are keeping up [with] the curriculum ... 
and then there is a large group of teachers that aren’t ... there are a group of 
teachers who are not willing to put in any extra time ... they look upon it more 
as a job and not as a career.” 

4, Experienced consultants were more likely, especially in one-to-one 
consultation, to schedule practice in which the teacher would receive some 
kind of feedback on his or her performance. For example, one experienced 
consultant provided teachers with a general procedure for teaching children 
how to plan an artistic composition; he also provided a schedule for introduc- 
ing this procedure to the students: 


I’ll quickly demonstrate how that will work and set up some timelines. The 
first time ... have sort of a chance for motivation, first to get some verbal 
input, to get some ideas going, some mental images. And then you suggest, in 
grade 7, have the child tell you six things that are going to be in the picture, 
and start with the most important ... have them start their own plan ... and 
then that might be the end of Day 1. Day 2, then you start to get rid of the 
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unimportant things in the picture and work towards the centre of interest. 
And the children know exactly where they are going. They have a formula for 
doing it and yet you’re not impinging on the content ... they’re still open and 
free. 


The teacher’s implementation of the assigned activities was observed by the 
consultant and/or principal in subsequent visits to the school. 

The inexperienced consultants were less likely to provide practice with 
feedback in one-to-one consultation although two of the six did so using a 
similar format to that of the experienced consultants. The routine of the 
other inexperienced consultants consisted of a single visit in which the 
consultant modeled the desired behavior or provided materials describing 
what the teacher should do. The extended series of interactions described by 
the experienced consultants (i.e., an objective setting meeting followed by a 
series of shared teaching episodes in which the teacher’s role gradually 
increased) was not visible in their transcripts. 

Some of the experienced consultants also reported that they scheduled 
practice activities in workshop settings as well as in individual consultations. 
For example, “there will be people who need to be brought along and ... if they 
can practice among their peers before they go into the classroom and get a 
feeling of what’s going to happen....” These practice activities were close to 
the classroom behavior to be learned. For example, for a workshop in learn- 
ing how to co-plan with students, the consultants decided 


to bring in students and assign a student to each teacher ... the student had 
picked a topic of interest and together they started to try to co-plan what it 
was they were going to do ... we moved around to try to be, well, facilitators or 
coaches or whatever you want to call it. Trying to listen in on what was going 
on between the teacher and the kid and try to provide some feedback in that 
direction. 


Another experienced consultant reported that she had teachers work with 
the same art materials that the students would be using. Yet another had 
teachers simulate teaching by teaching other teachers or developing lesson 
plans in small groups. 

d. Four of the six experienced consultants reported that they assigned 
practice tasks to be undertaken between consultation sessions. None of the 
inexperienced consultants reported arranging activities that would require 
teachers to consolidate what they had learned in a workshop or in an in- 
dividual consultation by applying the new ideas to their own classrooms. 

Two of these experienced consultants also reported sequencing the prac- 
tice activities in order to reduce demands on teachers as they acquired the 
new teaching skill. For example, one reported that after modeling a whole 
class art lesson, 


realizing that that teacher probably would still have a lot of difficulty in doing 
this with a group of 25 or 30 students, [I] suggested she use a centre approach 
... | was able to first of all, get over the initial hump, have something effective 
happen ... and leave her with something she felt comfortable with. And she 
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could have 4 or 5 working back there and keep a close eye on what they were 
doing. 


The same consultant reported that subsequent interventions with the teach- 
er gradually increased the complexity and the messiness of the art materials 
used. He also recommended that the teacher “try it at home first or after 
school.” The other experienced consultant who sequenced practice reported 
that “I might have shown five steps in my giant workshop, but then I might 
go back grade by grade and break it down and say, OK, well let’s just work on 
moving a little bit into step one or step two with some students.” 

6. Most experienced consultants arranged for teachers to report on their 
attempts to implement new ideas. The reporting activities took a number of 
forms. For example, teachers were scheduled to report to other teachers at an 
in-school or family of schools meeting: “come back to an ASG [i.e., a family of 
schools] meeting for a sharing session. And how has it worked in your school? 
With your equipment?” Teachers were asked to report to principals in self- 
appraisal sessions or the consultant served as a listener: “I brown bagged to 
her school that day and we sat down to lunch.... We talked about what worked 
and what didn’t work.” These reporting sessions were planned in advance; 
the teachers typically knew that they would be called on. These sessions were 
seen as motivational devices: Having teachers display examples of student 
work that were produced by implementing the consultant’s recommenda- 
tions gave teachers a great boost. They also improved the likelihood that 
teachers would put consultant ideas into practice; noncompliance would be 
visible. Furthermore, explaining things to others was seen as a good way to 
consolidate one’s own knowledge. It was also a mechanism for communicat- 
ing to other teachers the benefits of the innovation: They provided tes- 
timonials from satisfied users. 

Most inexperienced consultants did not have teachers report on their 
implementation efforts to others. One who did use the strategy as a way to 
introduce new ideas which may have been unknown to the consultant: “[in] 
many cases you can bring other things in that you want to talk about that you 
may not necessarily know how to approach.” 


Delivery 


7, All experienced consultants assembled teams that involved the consult- 
ant with expert teachers, principals, and other consultants in the delivery of 
consultation. One experienced consultant, for example, reported organizing 
a workshop series with a team of 20 people; responsibilities for delivering 
specific components were shared. Experienced consultants regularly assign- 
ed school support tasks to expert teachers and principals where such resour- 
ces were available. 

In contrast, only two inexperienced consultants reported working in mul- 
tiagent teams. These two worked closely with principals, but at their direc- 
tion rather than collegially. For example, “the principal evaluates what is 
happening within his school and my role is support ... [my role depends] on 
the principal. Strictly on request. I don’t go from school to school saying, my 
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thing is, I can do.” The remainder worked alone or with small teams consist- 
ing of a few expert teachers. 


In this job I expected there would be more team work. Perhaps it’s a function 
of the fact that there are four of us that are brand new this year in the job. We 
all tend to be doing our own thing.... That whole aspect of the teacher as an 
isolated being continues when you are a consultant and I thought it wouldn’t. 
It’s one of the aspects of the job I find least satisfying. 


It may be that this difference between the experienced and inexperienced is 
a reflection of the opportunity that consultants have to develop networks as 
they become known in the role, as much as it is a reflection of a chosen 
strategy. 


Follow-Up and Evaluation 


8. All the experienced consultants reported making substantial adjust- 
ments to their plans on the basis of feedback from teachers. For example, one 
experienced consultant reported that an outside speaker completely flopped, 
so he took over the session and worked with the audience to find ways in 
which the speaker’s materials could be used effectively. The same consultant 
reported that “I wouldn’t go in with a rigid view of what I had to come out 
with. As long as I see some development, some growth, then I’m quite 
happy.” This willingness to change objectives during the delivery phase was 
shared by other experienced consultants. For example, 


You find people asking questions that indicate they are at a much higher level 
than you anticipated. And if you’ve got a good handle on your material you 
can adapt very easily ... the workshop just changed gears because we’ve got a 
pretty good handle on where things are. It wasn’t a problem for us. 


Such flexibility was reflected in the comments of other experienced consult- 
ants who were quick to terminate activities which were not working well or 
to replace materials that were found to be inappropriate. 

Inexperienced consultants reported modest adjustments, if they adjusted 
at all. For example, 


I had a teacher in the room who obviously was further along and you could 
tell by her body language that she was frustrated, that what she wanted from 
the workshop was not what I was providing ... I zeroed right in on her at the 
end of the workshop ... I said perhaps I could come in and talk to you about 
some of the needs that you have in particular. 


Another inexperienced consultant when asked what she does when she en- 
counters an unreceptive audience replied, “you forge ahead anyway.” She 
went on to describe a situation in which she recalled having reservations 
about this strategy but even though “it wasn’t urgent, it was needed to be 
moved on and so we did.” Yet another reported that when facing teachers 
who don’t want to change he suggests, “try it my way ... and then let’s have 
a debriefing at the end and see if you can convince me of your way.... 
Sometimes that works, and in a few cases it didn’t ... my solution in the 
future would be to demonstrate a little more leadership.” 
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9. Consultants differed in the type of data they used to judge the effective- 
ness of consultation. Most experienced consultants talked about observable 
change in teacher behavior. For example, in describing a successful interven- 
tion with two teachers the consultant reported that his subsequent observa- 
tion in their classrooms provided evidence: “I see the things that I’ve taught 
in those classes, I see them again and again, and there was a lasting effect 
and they’ve gone with it and have grown with it.” Others referred to requests 
for materials required by the program, submissions from teachers to newslet- 
ters, requests for additional input, and other methods of determining wheth- 
er the consultant’s advice had been implemented. Experienced consultants 
also talked about consultant effectiveness in terms of measurable changes in 
student outcomes: “student evaluation ... see how it matches the County 
standards.” 

Inexperienced consultants, with one exception, relied on looser and more 
affective categories of data to judge consultant effectiveness. They talked 
about changes in teacher behavior but not in terms of teaching skills: “The 
morale of teachers has changed. Prior to that it was just, excuse the terminol- 
ogy, every time we got together it was bitching sessions.” Others said they 
knew they were effective when enough trust had been established that 
teachers were willing to try something new. “Getting to know the person” 
was the key criterion. Inexperienced consultants also judged their effective- 
ness in terms of the extent to which teachers perceived their offerings as 
useful: “T have to relate to positive affirmations, I think. To know that what 
you are doing is appreciated and is effective.” In recounting an unsuccessful 
event in which there were no “warm fuzzies,” the anguish comes through: 


They jumped down my throat.... They decided to dump on me. I just felt 
really, really unhappy when I left. I felt that I had done a lousy job, I was very 
uptight about the whole thing ... I felt I was blaming myself because I couldn’t 
come up with what they wanted. 


10. Almost all the experienced consultants used the data they obtained on 
the effects of consulting to design subsequent events. They reported that 
“you know how far people have progressed and it gives you things to work on 
for another time” and “it’s an ongoing thing, so you identify materials 
needed, supplies needed, and quite often what will come out of it will be 
strategies needed, then you move into a further sharing session.” One expe- 
rienced consultant described this routine monitoring and replanning as her 
fundamental task: 


I think that’s what really the essence of our job is, being able to identify where 
people are, what their needs are, whether it’s strategies in their classrooms, 
or whether it’s evaluation techniques or whatever it is, identifying their 
needs and being able to respond to them. 


Another consultant contrasted the current tracking routines she used with 
those that used to prevail in her board when consultants “could go out and do 
whatever we jolly well pleased as far as workshops went ... [now] we have to 
write a management plan and report.” 
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In contrast, none of the inexperienced consultants reported using evalua- 
tion data in an ongoing replanning cycle. For some it was not clear how they 
used the data they obtained, if they used it at all. For example, one consultant 
reported “I provide [teachers] with a questionnaire that they fill in collective- 
ly just to give me their ideas ... I may not even use the ideas but what I’m 
doing is collecting the information for me to get as much knowledge as I can.” 
Other consultants had vague plans to use information about the effects of a 
particular activity if the opportunity for the same activity should arise: “I 
think if I do it again next year, I will try to” or “an interesting portion of the 
program that I didn’t use the first time, but did use the second time ... I’d do 
more of that in the future.” More commonly, the inexperienced consultants 
did not use evaluation data for designing new events because they didn’t 
have useful information. 


Models of Consulting 


The differences in strategies used by the two groups of consultants can be 
represented as a pair of models, as shown in Figures 1 and 2. Figure 1 
represents the simple model followed by inexperienced consultants. Consult- 
ation begins with either a request from a teacher (or principal, step la) in 
which case it is processed through the consultant’s perception of the situation 
and the expressed needs of the teacher (step 2), or it comes from the consult- 
ant’s own agenda (step 1b). In neither case is there an extensive search for 
information before setting specific objectives (step 3). The objectives for the 
consultation do not differentiate between groups of teachers. The session, 
typically given by the consultant working alone, consists of pure delivery; 
there is no explicit provision for practice (step 4). Evaluation is limited and 
focuses on the specific event. 


1s Request ee 
ee 


Consultation 
Session 


2 Perception of Consultant 3 obser 
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Figure 1; Simple model of consulting 
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Figure 2 represents the strategic deliberations of the experienced consult- 
ant. The key element in the model is the shaded box representing the reflec- 
tiveness of an actor who consciously plans, experiments, and evaluates in the 
role (step 1). These consultants generate their discrete actions from an 
explicit framework which is informed by a keen sense of long-term goals, 
mechanisms for negotiating specific objectives, knowledge of planned change 
practices, and constant self-appraisal leading to tactical realignment. Prior 
to the initiation of a particular consultation, the experienced consultant 
develops an image of what should be derived from board priorities and 
Ministry guidelines (step 2). The experienced consultant recognizes that the 
specific requests received from various school system agents (step 3a) and 
the consultant’s proactive interventions (step 3b) are each related to the 
curriculum environment of the board. Requests and initiatives are rein- 
terpreted in the light of information obtained from a variety of sources (step 
4). A prime source of needs data is provided by objective indicators of student 
and teacher performance (step 6); these describe the gap between desired and 
actual practice represented by placements on program profiles (step 5). The 
differentiated objectives that emerge focus on narrowly targeted needs of 
individuals and groups (step 7). These objectives are addressed by a consult- 
ation team in a series of sessions that provide for demonstration (step 8) and 
for practice within (step 9) and between sessions (step 10). Each step of the 
process is formatively (in terms of consultee feedback) and summatively (in 
terms of teacher and student changes) appraised and integrated into the 
evolving framework of the reflective consultant (step 1). 


Discussion 


The main finding from the study was that there were differences in the 
strategies used by experienced and inexperienced consultants which seemed 
to be consistently maintained over a series of different contexts—that is, 
different consulting tasks in different boards. Examination of these differen- 
ces suggests that the experienced consultants can be viewed as experts. 
Previous research (e.g., Bereiter & Scardamalia, 1986; Berliner, 1986; Leith- 
wood & Stager, 1987) suggests that experts share certain characteristics that 
distinguish their behavior from that of novices. 

Experts know more: Their knowledge structures are larger and are or- 
ganized in different ways. In terms of amount of knowledge, the experienced 
consultants in our sample knew more about the degree of effort required to 
bring about change and the specific strategies for doing so. In contrast with 
their inexperienced peers, the experienced consultants arranged for a series 
of interactions with individuals and groups. They also assigned practice 
tasks, either within the session or between sessions. These strategies are 
congruent with one of the most strongly supported findings from research on 
the change process: Redundant encounters with the behavior to be learned 
over extended time frames are necessary to bring about lasting teacher 
change. In terms of the way knowledge is organized, there is some evidence 
that experienced consultants represented problems differently than the inex- 
perienced. Experienced consultants defined needs in the context of a system 
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plan, and they had a more sophisticated category scheme for classifying 
teachers. 

Experts have more complex skill structures than novices: They are more 
flexible in their processing; they are able to handle more data; and they are 
more conscious of the effectiveness of their strategies. The experienced 
consultants in the sample demonstrated greater processing flexibility in their 
response to teacher feedback and in their ability to accommodate different 
needs in a single event. Experienced consultants also showed that they were 
able to handle more information in that they had more elaborate ways of 
determining the effects of their interventions and made greater use of such 
data in thinking about their role. 

Experts are simultaneously sensitive to the task demands and to the 
social structure of the job situation; novices are not. The experienced consult- 
ants in the sample demonstrated a sensitivity to the social context through 
their networking and team-building activities. Experienced consultants, to a 
greater extent than the inexperienced, addressed the needs of a broad array 
of school system roles and involved other agents in the delivery and monitor- 
ing of consultation. They also used teachers reporting to others as a techni- 
que for consolidating learning from consultation. 

A second important finding emerging from this study is that the charac- 
teristics that distinguished experienced from inexperienced consultants con- 
cerned the overt strategies used to bring about change. There was little 
evidence that the crude quantitative measures (amount of consulting or time 
spent) identified in previous studies distinguished the behavior of the experi- 
enced from the inexperienced. Similarly, the attention given in the literature 
to the importance of personal attributes was not confirmed. It may be that 
there was little variation across our sample on these personal variables: For 
example, the importance of interpersonal skills was so well known that it 
may have been a necessary condition for selection to the role. The practical 
value of focusing on differences in overt strategies (as opposed to tacit 
knowledge or personal attributes) is that strategies are more easily taught to 
newcomers to the role. 

A third finding concerned the interplay of individual achievement with 
the support/constraint provided by the school system. There was evidence 
that the tactics and executive strategies of the consultants in our sample 
were deeply affected by their specific job descriptions and the institutional 
contexts in which they worked. System plans, networking within teams of 
consultants, and coordinated support between line and staff positions were 
key elements in the strategic planning of the experienced consultants. Be- 
cause all of these elements are largely beyond the control of the individual 
consultant, the potential for superlative performance is greatly diminished in 
unenlightened school systems. But we are more sanguine than Fullan (1981) 
and Fullan, Anderson, and Newton (1986) who argue that the effectiveness 
of consultants is dependent on the emergence of an enabling institutional 
context. Many of the differences between experienced and inexperienced 
consultants we found (e.g., with regard to the provision of practice and 
follow-up) are within the control of individuals. It is more difficult to perform 
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these activities in an unsupportive environment, but it is possible: There is a 
place for the heroic consultant. 

The findings from this investigation are limited in several ways. The data 
were derived from only two boards which were probably significantly more 
advanced in curriculum organization than other school systems in the prov- 
ince. The differences in strategies were identified from self-reports rather 
than from observation. The study did not shadow consultants to measure the 
accuracy of their reports, and we obtained no independent data on the impact 
of their strategies. It may be that the experienced consultants in our sample 
were merely those adept at providing a more elegant apologia for their 
actions. And we do not know what portion of the differences should be 
attributed to selection as opposed to growth in the role. It is also possible that 
some of the differences might reflect differences in ideological support for 
system policies, strengthened when the experienced were reappointed to the 
role, rather than differences in skill or knowledge. 

Despite these caveats, the discovery of consistent differences between 
experienced and inexperienced consultants suggests that the strategies of 
individuals may develop in the role and may be addressed in professional 
development programs. In analyzing the data we noted that there were 
variations within the categories of experienced and inexperienced. We have 
attempted to capture these variations in a profile of consultant strategies 
which is being used to stimulate the reflection and professional growth of 
consultants. The impact of this inservice, and the personal and board charac- 
teristics that contribute to the effects are the focus of our current research 
agenda. 


Notes 


1. This research was funded by the Ontario Ministry of Education, Transfer Grant 87-1056. 
The report reflects the views of the authors and not necessarily those of the Ministry. 
Florence Maynes made a significant contribution during the data collection and data 
analysis phases of the project. Dianne Gates assisted in the interpretation of the 
interview data. A longer report on the study is available in Ross, Regan, and Maynes 
(1988). Andy Hargreaves and Anne Hogaboam provided helpful comments on an earlier 
draft. 

2. Available data banks (Resources in Education, Current Index to Journals in Education, 
and Ontario Educational Research Information Service) were machine searched using 
appropriate key words. Abstracts of hits were used to determine which studies would be 
obtained and examined in detail. Citations in each identified study were pursued through 
manual procedures. 

3. The research reported here focused on internal consultants—those employed by a school 
district—rather than external consultants employed by an outside agency (e.g., a 
university) to bring about change. Although there have been few attempts to compare the 
roles (e.g., Fullan, 1981; Louis, 1981), the similarities seem to be greater than the 
differences. 
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This article summarizes a case study of students in a two-year college transfer 
program as they reached the point of transition to university. As well as describing 
their characteristics and perceptions, the study sought to distinguish between 
persisters and nonpersisters and to seek out factors associated with attrition. Data 
were gathered using two questionnaires: one administered to Red Deer College 
transfer students in March of their second year, and a follow-up in the fall of the 
same year. Only 10% of respondents did not persist in their studies, and they were 
indistinguishable from the persisters, except in having recognized the possibility 
of “stepping out” as an option. Both persisters and nonpersisters were satisfied 
with their college experience and felt that they had achieved their academic goals. 
The study confirms the value of transfer programs and reveals suggestions for 
dealing with problems encountered in transferring from college to university. 
Furthermore, the study provides a context for exploring policy questions relevant 
to access to degree programs in Alberta. 


Most of Canada’s two-year colleges were created during the 1960s and 1970s 
to provide enhanced postsecondary opportunities for those who were not 
inclined to attend university. In British Columbia and Alberta, however, the 
mandate of the new colleges included first- and second-year university trans- 
fer programs. As these colleges have evolved, transfer has become firmly 
institutionalized: No college has given up this function and others have 
sought to expand it. The centrality of the transfer program has led to 
renewed interest in degree-granting status among certain colleges in both 
provinces (see, for example, “Increasing Access,” 1988; Lau, 1988; Strick- 
land, 1988). 

Although decisions to extend degree-level studies are mainly political in 
nature, educational factors such as the capabilities of the colleges, their 
record of success, and the perceptions of students must also be taken into 
account. For instance, what percentage of transfer students actually proceed 
to university? How well do they do when they get there? The literature from 
the United States has been generally positive on the achievement of transfer 
students when they reach university (see, for example, Cole, 1986; Holahan, 
1983; Rodriguez, 1987). Although there are fewer Canadian studies, their 
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findings have been equally affirmative (see, for example, Dennison, 1982; 
Small & Konrad, 1986; Vaala, 1988). These studies show as well that stu- 
dents generally feel positive about transfer and are satisfied with the college 
program as a preparation for senior-level studies. While students may have to 
go through an adjustment phase, this is usually completed during their first 
year at university. Grades may dip but tend to return to about pretransfer 
levels. 

This article is concerned with the more troublesome question of how 
many students in transfer programs actually proceed to university: What 
factors are associated with decisions to continue or to withdraw? And how 
many would have preferred to remain at the college to complete their de- 
grees? A 1988 case study of students at Red Deer College (Tyler, 1988) 
provides some answers. 

The record has not been favorable concerning rates of transfer from 
college to university, in some instances reaching a low of 10% (see, for 
example, Knoell, 1982; Sheldon, 1982). Student factors associated with 
attrition include financial problems, job commitments, housing problems, 
poor grades, personal difficulties, family responsibilities, lack of clear goals, 
and studying part time (Astin, 1975; Knoell, 1982; Sheldon, 1982). Converse- 
ly, factors associated with high persistence include full-time status, tradition- 
al college age, moderate work commitments, high academic expectations 
(Cohen, Brawer, & Bensimon, 1985), informal interaction with faculty (Gil- 
bert & Gomme, 1986; Lunneborg & Lunneborg, 1986), and supportive in- 
stitutional environment (Boshier, 1971; Cope & Hannah, 1975; Pascarella, 
1986). 

There have been few Canadian studies of attrition, but the experience in 
British Columbia suggests that the problem may be just as serious as in the 
United States (British Columbia Ministry of Advanced Education and Job 
Training, 1987). Attrition is detrimental to the institution because it repre- 
sents a loss of income and a wastage in training and may also be a sign of 
institutional failure. For the student, attrition may be personally traumatic 
causing frustration, loss of self-esteem, shattered career plans, and lowered 
lifetime earnings. 

Students may be most attrition-prone at the point of transfer when they 
face problems associated with moving, in addition to any continuing aca- 
demic difficulties. For some the temptation to withdraw is too great to 
overcome. 

The case study reported here was conducted at Red Deer College which 
was created in 1964 as a comprehensive college with a strong orientation 
toward university studies. Students are drawn mainly from the city of Red 
Deer which has a population of about 60,000, but the catchment area extends 
some 50 km including an additional 100,000. Most of the 6,000 full-time and 
part-time students commute to college, but there is residential accommoda- 
tion for over 500. 

Although the college was designed to serve students of all ages, more than 
half the 2,000 transfer program students enroll within a year of completing 
high school. These students take courses in arts, sciences, education, com- 
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merce, fine arts, music, physical education, and a range of preprofessional 
programs. Course credits may be transferred at any time although typically 
students transfer after two years to one of Alberta’s three traditional univer- 
sities. About 65% transfer to the University of Alberta in Edmonton 150 km 
to the north, and about 25% to the University of Calgary an equal distance to 
the south. Only 3% transfer to the University of Lethbridge 400 km to the 
south. 

The principal purpose of the study was to explore attrition-related factors 
as students experienced the transition from college to university. More spe- 
cifically it set out (a) to examine the characteristics of Red Deer College 
transfer students near the end of their transfer program, (b) to determine the 
attrition rate and related student perceptions, and (c) to identify any dif- 
ferences between persisters and nonpersisters. 


Method 
Students registered in the second year of the transfer program were mailed a 
questionnaire in March 1987. A response of 160 (65.5%) was obtained. (A 
possible explanation for the lower-than-anticipated response rate was the 
administration of the questionnaire around the time of final examinations.) 
Information which prior research had identified as influencing persistence 
was gathered. 

A second questionnaire was mailed to respondents in October of the same 
year producing a response rate of 97%. Persisters were identified as those 
who had enrolled in a university or were continuing their studies elsewhere. 
The questionnaire elicited perceptions of their transfer experiences. In the 
case of nonpersisters the questionnaire sought reasons for discontinuance 
and indications of future study intentions. 

Findings 

Background Characteristics 

The transfer students can be described as a traditional, homogeneous stu- 
dent population. Most were young (62% under 21), single (79%), full time 
(93%), and were evenly divided by sex (53% female). Although most had some 
financial concerns, relatively few (34%) were working (average of 13 hours 
per week). Only 23% could be considered nontraditional in the sense of 
representing an older population. In this respect Red Deer College is like 
most other public colleges in Alberta (Vaala, 1988, p. 67). 


Academic Integration 
Respondents appeared to be well integrated academically, 44% having good 
to excellent grade point averages. Their modal GPA on a 9-point scale was in 
the range 6-7. They were generally satisfied with their academic perfor- 
mance, 18% rating their achievement “better than expected” and 41% 
“about as expected.” To accomplish this students spent an average of 35 to 
40 hours per week on academic activities divided equally between classroom 
and private study. 

Other indicators of integration were evident: Respondents expressed sat- 
isfaction with informal contact with faculty (mean of 3.4 hours per week 
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from an overall 16.7 hours in college-related activities of a social nature). In 
keeping with these findings 73% claimed that friendship support was one of 
the main advantages of starting postsecondary studies at a college. 


Intentions and Goals 


The respondents appeared to have positive attitudes about their academic 
futures. Ninety-two percent were planning to continue their studies in the 
fall; 65% expected to complete their first degree in two years; and 86% 
anticipated working toward an advanced degree. Few foresaw any possibility 
of dropping out, and most (68%) believed there was little likelihood of failing 
any courses after transfer. About half believed they could achieve an 8 or 9 
GPA while at university. Respondents were equally committed to their career 
goals. 


Environmental Factors 


Respondents were asked to rate, on a 4-point scale, 42 factors relating to 
their perceptions of the college environment. They perceived most factors to 
be important or very important. The eight highest in importance were all 
related to instruction. The ratings of actual environment were similarly high 
or very high. Quality of instruction (mean 3.0) was rated slightly higher than 
factors related to peers (2.8), physical aspects of the campus (2.8), and 
support services (2.8). These positive attitudes suggested that there was a 
good fit of students to college. Eighty percent would have preferred to remain 
at the College for degree completion had that option been open to them. 


Persistence and Nonpersistence 


When the above descriptions are examined in the light of the literature on 
persistence, they suggest the likelihood of a high level of persistence for the 
respondent group. Such was indeed the case. Only 16 students discontinued 
their studies at the end of the second year, while 109 transferred to university 
and 35 reregistered at the College. 

There were few distinguishing features in the nonpersister group. Their 
backgrounds and academic characteristics were similar to those of the per- 
sisters, but they showed less confidence in their academic ability and pros- 
pects: Nonpersisters had higher expectations of temporarily interrupting 
their studies (stepping out) and expected to take longer to complete their 
degree programs. Most were considering discontinuing at the time of the 
March survey but were planning to resume within three years. Based on 
group means, nonpersisters tended to rate goal statements slightly lower 
than did persisters which suggests that they may be less goal-oriented. 
However, statistically significant differences were found in only 4 of the 22 
goal comparisons. 

The highest ranked reasons for nonpersistence were “need for a break in 
studies,” “obtained a desirable job,” and “lack of financial support.” Half the 
reasons given for withdrawal may be categorized as circumstantial (e.g., 
financial difficulties, draw of an attractive job, serious health problems, or 
nonacceptance by the university). Other reasons were attitudinal (e.g., need 
to rethink career/life plans, or being tired of studying). For those experi- 
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encing attitudinal difficulties, their time in college may have been a period of 
self-exploration and psychological development. Withdrawal from studies 
may have been a considered, appropriate, and positive decision. It is doubtful 
that any action on the part of the College could have or should have prevent- 
ed these withdrawals. However, the circumstantial difficulties are conditions 
which the College should attempt to ameliorate. 


Transfer Difficulties Encountered by Persisters 


In the March survey, responses indicated that transfer students were an- 
ticipating high to very high levels of difficulty in the following: coping with 
the increased costs associated with university, getting to know the faculty, 
obtaining recognition, accessing services, securing appropriate accommoda- 
tion, selecting courses, coping with academic requirements, getting to know 
their way around campus and the city, and making new friends. However, by 
October persisters ranked only three factors above the midpoint of the dif- 
ficulty scale: coping financially, obtaining recognition, and getting to know 
faculty. Clearly the difficulties in adjusting to university were not as exten- 
sive as feared. 

Although credit transfer was not rated high as a problem area, 22 of the 
persisters indicated that they experienced some difficulties in receiving credit 
for courses completed at the College. These difficulties were attributed to 
system problems connected with the university registration process, or to 
inadequate counseling while at college. 


Discussion 

Generalizing from a single case study is obviously risky. However, this study 
offers a glimpse of transfer students in an Alberta community college and 
provides further evidence of the value of colleges as transfer institutions. 
Two basic issues are implicit in this study. Is the transfer program serving 
the purpose for which it was designed? And are students satisfied with 
starting their postsecondary studies in this way? The answer to both ques- 
tions appears to be an unequivocal yes. 

There were surprisingly few nonpersisting students in the transfer 
program. Ninety percent of the students continued their studies, one way or 
another, at the end of their second year. Both persisting and nonpersisting 
students appeared to have been academically and socially well integrated in 
the College, and they were satisfied with their college experiences. Few 
significant differences were found between persisters and nonpersisters, 
except that the latter viewed themselves as noncontinuous students and 
expected to take longer to achieve their degree goals. 

No causal relationships can be determined with certainty, but it is possible 
that the traditional nature of this College’s clientele, their apparent satisfac- 
tion with the academic environment, and the high ranking they gave to 
academic goals all contributed to persistence. Although 86% had some finan- 
cial concerns, this did not appear to be a major factor in a decision to continue 
or withdraw. This is consistent with findings reported by Pascarella, Smart, 
and Ethington (1986) who have shown that financial difficulties are less of a 
deterrent when a student’s other experiences are positive. 
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While the articulation process between Red Deer College and provincial 
universities appears to be functioning well, students felt some apprehension 
about it. They anticipated more difficulty in adjusting to university than 
they actually encountered. Respondents’ comments revealed that many 
viewed their entrance to university with trepidation, using such words as - 
“overwhelming,” “intimidating,” “a shock,” “cold and impersonal,” “the 
student is just a number,” and “too large and confusing.” As there is no good 
reason to enlarge student fears, college counselors may wish to consider ways 
of reducing apprehension by developing more realistic images and by helping 
the students accept the transition process as a challenging but potentially 
enjoyable experience. Reference to the findings reported by Vaala and Hol- 
daway (1989) concerning the relatively short adjustment period may be 
relevant. Colleges and universities should consider joint sponsorship of an 
annual orientation program to help transferring students develop positive 
images of the next phase of their studies. 

The Alberta Council on Admissions and Transfers was established in 1974 
to develop procedures to facilitate transfer. Through publication annually of 
a transfer guide, it has made huge strides in removing uncertainties about 
which courses will transfer and which will not. Despite this fact, 21% of 
students transferring from Red Deer College experienced difficulties in re- 
ceiving all the credits to which they felt entitled. Thus there is a need for 
colleges to ensure appropriate counseling and course selection, to monitor the 
transfer process carefully, and to serve as advocates for students experi- 
encing difficulties not of their own making. 

Generally, the findings attest to the workability of transfer and may be 
significant for policymakers facing pressures to expand access to degree-level 
studies through the college system. Also, the study may be of interest to 
provinces such as Ontario and Manitoba which have no formal transfer 
programs, but where debate on such developments is ongoing (see, for ex- 
ample, Tausig, 1988). 

The question of degree-granting status is illuminated by this study. In 
response to a direct question, 80% of students at Red Deer College indicated 
that they would have preferred to complete their degree programs at the 
College. The case for completing degree programs in their local college 
includes the preservation of ties and commitments involving family life, 
work, and sense of community, as well as feelings of pride, confidence, and 
satisfaction in the college which had guided them into postsecondary educa- 
tion. At the same time, respondents noted several academic advantages of 
moving to a university. The number of students who would actually remain 
at the College for degree completion, if this were possible, would depend on 
many more factors than a simple desire to stay near home. 

Finally, questions of equity arise for nonpersisters who may have achieved 
personal goals while in the transfer program, despite their unwillingness or 
inability to proceed to university directly. They should be recognized for what 
they have accomplished rather than disparaged for what might have been. 
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Conclusion 

This descriptive case study of transfer students as they enter and experience 
the transition between college and university presents a favorable image of 
transfer as it related to one college. To the extent that Red Deer College is 
typical of other transfer institutions such as Medicine Hat College, Grande 
Prairie Regional College, and Keyano College, it may be concluded that the 
system is working well. The attrition rate was less than 10%, and students 
embarked on their studies at university with positive sentiments regarding 
their college experience and bright hopes for their futures. Even those who 
did not persist expressed satisfaction with the college and anticipated re- 
entering formal studies some time in the future. Such students should not 
automatically be regarded as failures of the system. The reasons for attrition 
seem more tied to circumstances than to achievements in college. The con- 
cept of failure in the context of a transfer program may have to be revised to 
allow for the possibility of achievement of personal goals aside from transfer 
and the possibility of deferred entry to senior-level studies. 
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THE EDITOR’S PAGE 
Where is it Leading? 


The lead article in this issue of AJER is the report by Professor Thomas 
Kieren to the University of Alberta’s Faculty of Education on his McCalla 
Research Professorship. Each year, one or two faculty members are awarded 
a special opportunity to spend an academic year in pursuit of their research. 
Usually the recipients spend the time collecting their research into a more 
unified whole. In this issue Kieren reports on two decades of research on 
mathematics learning. In the next issue, the address presented by Professor 
Lorraine Wilgosh on her work with families of handicapped children will 
appear. Both of these papers are distillations of research experience and are 
the kinds of reflections that should be the ultimate result of research en- 
deavors. 

As is generally the case, this issue of AJER also contains a collection of 
papers on a variety of topics. The articles by Cole and Housego provide an 
excellent illustration of how different approaches bring out complementary 
sorts of information. Their research on that two- or three-year segment of 
teacher development falling between the final steps of formal preparation 
and the end of the first year of teaching is part of a very active area. In fact, 
if we were to categorize submissions to AJER, this kind of work would form 
the largest group. However, there is much more to be done, and, to restate a 
theme of previous editorials, no single method can do justice to this complex 
period of a beginning teacher’s life. Surveys give a skeleton; case studies 
provide soul. 

The articles by Marfo and by Meyer, Wilgosh, and Mueller are two varia- 
tions of quantitative methods—laboratory and archival. Although they are on 
quite different topics, both speak to problems in the education of young 
children and have implications for teachers and administrators alike. Finally, 
Johnson and Holdaway’s article gives us some insights into the lives of 
principals. In a sense we have moved full circle. The study of the educational 
development of principals in contemporary society requires the same kind of 
attention that is being devoted to teacher development. 

As important as all of these articles are, we are now at the stage where we 
need some integrative pieces. I am confident that few policy makers, teacher 
educators, and educational administrators work from discrete articles. The 
constraints of their positions demand information in a partially digested 
form. Unfortunately, the theory-into-practice paper has not been a priority in 
the academic educational research scene. Such work is more commonly 
found in the professional magazines and in the “pop-ed” books written by the 
evangelists of the lecture circuit. Researchers must rethink their roles. In the 
academic journals, reviews have tended to be either annotated bibliographies 
or meta-analyses. These have a place, but neither serves the field completely. 
It is time for researchers to undertake critical, analytical, and integrative 
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reviews to reveal the trends in a field. These state-of-the-art reviews must be 
located somewhere between theoretical accumulations and popular promo- 
tions. They must tell educators what directions can be safely taken and which 
ideas should be treated with caution. They must integrate the empirical 
research with the philosophical literature so that the emerging picture fea- 
tures what is possible in a context of what is desirable. The review process of 
the academic journal serves the field by subjecting the writer’s ideas to the 
scrutiny of the research community. This yields advice in which substance is 


given preference to flash. 
T.O.M. 
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THOMAS E. KIEREN 


University of Alberta 


Understanding for Teaching 
for Understanding’ 


Anything said is said by an observer. 

The observer speaks to another observer 

who could be himself. 

The observer is a human being. 

Whatever applies to living systems applies to the observer. 
(Maturana & Varela, 1980, p. 8) 


This presentation is based on reflections of a year spent as an observer. The 
things I talk about today are in the domain of the observer. These observa- 
tions relate to a continuing concern of mine expressed in a paraphrase of a 
quote from Warren McCullough (Kieren, 1988): “What are persons that they 
can know number? What are numbers that they can be personally known (p. 
162)?” The observations I am making today pertain to the first of these 
concerns—the person as an understander of number and mathematics in 
general. 

In highlighting these observations, let me turn immediately to the title 
and let me stretch it out a bit: “Understanding for Teaching for Under- 
standing for Teaching for Understanding for Teaching for Understanding.” 

This extended title is meant to illustrate several things. First, the theoret- 
ical ideas, the observations, and the curriculum materials discussed below 
provide various ways of enhancing cognitively guided instruction. This no- 
tion, developed by researchers in the United States such as Penelope Peter- 
son at Michigan State and my colleagues Tom Carpenter and Liz Fennema at 
Wisconsin (Fennema, Carpenter, & Peterson, 1986), suggests that if teachers 
could have in mind how children cognize mathematics, such insights could 
guide their instruction. As Dean Bob Patterson would say this idea harks 
back to the work of John Dewey. Or, put in its strongest modern form by the 
constructivist thinkers, one gets a favorite quote of mine from Les Steffe 
(1988) of the University of Georgia: “The child constructs the mathematics. 
The teacher constructs the child.” 

In the extended title we see many rounds: Understanding a child’s under- 
standing brings about teaching to build that understanding. The extended 
title suggests that this pattern repeats itself. One might even argue that it 
occurs recursively; prior understandings, whether of the child or teacher, are 
a basis for new understanding. 

The extended title also appears to imply that this process occurs linearly 
and over time. It might suggest that the understanding at the beginning was 


Thomas Kieren is a professor of secondary education. His special interests are in the nature 
and growth of personal mathematical knowledge and the role of teaching in promoting such 
growth. 
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distant and distinct from the understanding at the temporally later end. This 
might be the visible or explicate order of things. The observations I want to 
discuss with you today are of different implicate and enfolded and unfolding 
orders of things. Such orders are nonlinear in nature. 

Notice when one folds this title the layers of understanding are suddenly 
seen as close and connected. This would be even more obvious if I had rolled 
up the understandings. Then the hole in the middle could be seen as the site 
of still earlier understandings and so on. 

What I would like to do in the remainder of this brief essay is to give a 
flavor of the kinds of observed orders I was studying and how in fact we are 
trying to make sense of them. I will do four things: 

- Provide a basis in children’s mathematical behavior which suggests 
that understanding understanding itself is worthwhile for a researcher 
or teacher; 

- Present a nonlinear model of understanding as it is done by a person 
which has been built in collaboration with Dr. Susan Pirie of Warwick 
and Oxford Universities; 

- Present a few properties of the model with examples; and 

- Present some consequences for instruction and curriculum. 


Understanding Understanding—Why and How 
Consider the work of a grade 7 boy, Ronald. Ronald was part of a class which 
had received careful instruction in the arithmetic of fractions, especially 
adding, multiplying, and dividing. 


Bas Jy while 3 
Ee U aa 
bes tes eae 
We ail 


This work shows a classic error in adding fractions which evaluation data 
would suggest occurs in the work of junior high school children in Canada, 
the United States, or in many countries of the world including the United 
Kingdom and Israel. 

Here is Ronald’s work on a third item. This involved him in making a 
judgment on a quasiconcrete situation. 


“An order came in for - + as pizzas. The cook said, “That will total 


more than one pizza.” 
Was the cook right? No. 


Why? Because se a = a and that is much less than 1 whole pizza. 


At this point the teacher (in this case me, because Ronald was part of a 
teaching experiment which I was conducting) might be prompted to say, 
“Ronald has received meaningful instruction, he’s thinking, but he just 
doesn’t understand addition.” 


But let’s now turn to Ronald’s work on the very next item. 


2 
George ate 3 of a medium pizza. Ellen ate i of a medium pizza. 


How much more pizza did George eat than Ellen did? 
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Ronald, like the other students, had received only incidental instruction on 
subtraction. 


23 al 
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In this case, where Ronald did not have a ready algorithm, his under- 
standing of this situation and clearly of addition is easily observable. His 
work shows an image oriented algorithm paralleled by an action based on 
symbolic properties. Work like Ronald’s prompts one to ask, “What might it 
mean to understand mathematics [or anything]?” and “What might it mean 
to teach for such understanding?” 

There has been considerable work in mathematics education over the past 
20 years on understanding (Skemp, 1987; Schroder, 1987). One approach has 
been to develop categories of understanding such as instrumental and rela- 
tional understanding. By the former is meant that a child can do something 
but does not know why; by the latter is meant that the child can justify a 
performance in terms appropriate to the experience of the child. Other work 
has elaborated other categories of understanding related to the tools, materi- 
als, or experiences used in the process of doing the mathematics—for ex- 
ample, concrete, iconic, or symbolic understanding. Still other work (Tall, 
1978) suggested a combination of categories as characterizing understand- 
ing. This latter sounds practical in the sense of a researcher or teacher having 


Concrete Iconic Symbolic 
Relational 
Instrumental 
Logical 
None of the above 


a two dimensional check sheet. 

It may be that such categories or vectors or combinations of categories 
might well describe specific mathematical behaviors or outcomes of child- 
ren’s work such as each of those of Ronald above. Yet, as claimed by Pirie 
(1988), they do not well capture understanding as seen in the continuing 
mathematical work of students. She posits that such understanding is not 
well portrayed by either categories or linear combinations of categories. 

With Dr. Pirie, I have been developing a theory of mathematical under- 
standing as a whole, dynamic, nonlinear, recursive growing process (Kieren 
& Pirie, in press). As Minsky (1989) has recently suggested, unlike a ma- 
chine, humans can know something or understand it at many levels at once. 
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That this is not a simple platitude is evident as one carefully observes 
children learning and doing mathematics and is particularly visible in this 
single piece of work by Jim, aged 12, in response to this question: 

Who got more, person A who gets a fair share of pizza when 2 pizzas are 
shared among 3 persons or B who gets a fair share when 5 pizzas are 

shared among 8 persons? 

Jim’s response: 


A gets more. 


| eal Hae | | ee er eas Le: Br Bk a hat ile ah 
Because 3 of pis gand5 + G=G+G=G=3:2+8-381T 8-8 


; is more than 2 is bigger than é because there are fewer pieces. 


This item response nicely shows Jim’s understanding the arithmetic of frac- 
tions in many ways at once. For this to occur, these understandings must be 
close for the person having them. We are trying to build a theory which 
captures this phenomenon. 

What does a person’s mathematical understanding (MU) “look like”? I 
suppose one could picture it as a growing amorphous phenomenon (see next 
page). 

But we are trying to theorize and observe features of such understanding 
which go beyond such a vague image. In trying to characterize mathematical 
understanding, we have been informed from many sources. Our theory rep- 
resents an attempt to elaborate a constructivist definition of understanding 
provided by von Glasersfeld (1987): 


The experiencing organism now turns into a builder of cognitive structures, 
intended to solve such problems as the organism perceives or conceives ... 
among which is the never ending problem of consistent organizations [of 
such structures] that we call understanding. (p. 7) 


In trying to provide the understanding for teaching for understanding, we 
are trying to model the child, the student, or the person as an understanding 
agent. We picture the students we observe as self-referencing, self-maintain- 
ing persons who can access their own thinking as part of their learning 
environment. What they do in any situation is not caused by this situation; it 
is determined by their own structure which is continually growing and 
changing. This view is based on the work of Maturana and Varela of Chile 
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Reflecting 
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(1980, 1987) and Tomm (1989) of the University of Calgary who see knowing 
as the ability to take effective action as seen by an observer. Such knowing 
grows through recursion. That is, the person knows at a new level through 
making distinctions in previous knowing. 

As we try to give a clearer characterization of the pulsating growing whole 
which is mathematical understanding, what does our theory look like? We 
consider understanding done by a person as a leveled phenomenon with eight 
kinds of levels (see previous page). But you should not think that the levels 
constitute understanding. It is a nonlinear dynamic whole characteristic of 
each person. 

It is beyond the scope of this essay to describe the model above in any 
detail. But I would like to point out a couple of features of the theory and then 
briefly discuss a few implications of the theory for teaching and curriculum 
making. 

A central feature of personal understanding of anything is the capability 
to fold back. A brief description of this key process which we have observed in 
students’ mathematical work will give you a flavor of the theory. 

Imagine a grade 7 student, Tracy, who was started with primitive or 
assumed doing capabilities such as dividing up equally, knowledge of fraction 
words, and so forth. Through solving dividing up or quotient problems, she 
has developed an image of fractions as describing amounts and, in fact, she 


sees that different fractions, for example, $ and . describe the same 


amount. She has elaborated this image by understanding addition and multi- 
plication as properties of this quantitative image of equivalence and now sees 
fractional numbers not as individuals but as a formalized class—numbers of 


the form - Our theory suggests that her growth of understanding of frac- 


tions to date would be a nonlinear path among the levels from primitive doing 
to formalizing (Pirie & Kieren, 1990). 

Now the teacher asks a question which invokes inner level understanding 
for Tracy: 


Teacher: Is there a fraction bigger than = smaller than ae 


Tracy: 2??? consternation reigns! 
But then 
Tracy: (folding back) Well, 


Half plus half of a fourth is more than a half and less than three fourths. 
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[This is folding back to her image of fractions from image making and image 
having.] 

At this point Tracy who has the property equivalence in her under- 
standing of fractions could note: 


and 2 is half way between 


She could then reconstruct her fraction knowledge at a property level or 
could make an intuitive leap back out to the formal level and say, “Oh the 


a 1» 


average of 5 and = will work!” [a formal observation]. In fact we are 


speculating that folding back might partly account for the notion of intuitive 
leaps reported in mathematics and other areas. Thus from a level of for- 
malization Tracy folded back to an image making or image having level to 
make sense out of the formal level puzzle. After that she used equivalence at 
a property noticing level. The impact of such work was to reconstruct her 
understanding of fractions to add in this new idea of betweenness. Folding 
back was not simply recall. There was nothing in her history to directly 
answer the question. Tracy had to remember her knowing and to reconstruct 
her inner level knowing as a foundation for outer level, formal knowing. 

A second key feature of this model is that there is a complementarity of a 
process and a form oriented action. As seen in the model this complemen- 
tarity occurs at every level. I would like to look for a moment at one ring of 
this model (see next page). 

This is the level of property noticing which is complemented by the formal 
aspect of property recording. Under support from Alberta Education I have 
been building and testing some materials which allow high school students to 
use the computer as a tool to build geometric ideas. In one of the units they 
are asked to build from scratch their own geometry of a set of geometric 
objects called spirolaterals. As one of the tests of the material and as an 
investigation into the theory above, I had three pairs of university students, 
all with good mathematics backgrounds, engage in the study of spirolaterals. 
The materials use the above model of understanding and promote growth 
from image making to image having to property noticing to formalizing to 
observing to proving in a structure. Two of the groups very quickly had the 
image of spirolaterals as inwardly spiralling geometric objects. If you had 
been there, or had observed the videotaped record of their computer work, 
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you would have seen these four students almost madly generating or noticing 
properties. But they did not record them in any way physically or in writing. 
I suspect they didn’t even make many mental notes of what they were doing. 
It was as if even these advanced students sensed the existence of properties 
but did not have them. In terms of our theory we point out that one must take 
the form-oriented action of property recording before one can be said to be 
fully functioning at the property noticing level. Otherwise one’s under- 
standing is uncompleted at that level. More critically for practice, if little 
attention is paid to the form of recording (at least mentally and orally), then 
it might appear to the teacher that the students had not learned anything 
from the instruction in that session. The property recording is also essential 
to the student. The student must not only do something, but be conscious of 


each act. The person must not only act in an understanding manner but must 
give this action some form. 


Curriculum Implications 


I would like now to turn briefly to some curriculum implications of the model. 
In doing a teaching experiment on fractions, each day I built experiences for 
students based on their growing understanding. In some sense the cur- 
riculum was wrapped around their previous knowing. Further, I was struck 
with the continuing problem I experienced of attempting to invoke inner 
levels of understanding to help students grow in understanding. 

To see other implications I would like to call your attention to another 
feature of the model—primitive doing. This inner core represents simply 
what the observer assumes the person can do as we start to observe his or her 
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Curriculum Implications 


I would like now to turn briefly to some curriculum implications of the model. 
In doing a teaching experiment on fractions, each day I built experiences for 
students based on their growing understanding. In some sense the cur- 
riculum was wrapped around their previous knowing. Further, I was struck 
with the continuing problem I experienced of attempting to invoke inner 
levels of understanding to help students grow in understanding. 

To see other implications I would like to call your attention to another 
feature of the model—primitive doing. This inner core represents simply 
what the observer assumes the person can do as we start to observe his or her 
understanding. This gives the model a fractal quality, because anything can 
be assumed to be previous doing. This allows us to consider the under- 
standing of fractions as quotients as developed in grade 7 as primitive doing 
for a whole new set of levels of understanding of fractions as algebraic 
operators. Thus the curriculum, like understanding as done by a person, can 
be thought to have a fractal form (see previous page). 

Furthermore, as one looks at the circle of primitive doing carefully, one 
would also observe the same levels in the understanding there. Thus the 
model suggests that one might research the core understanding of the “half 


fractions”: 5 ‘ - c .. and see if the same model of understanding with 


levels and folding back did not obtain. I am currently working with 8-year- 
olds to observe this phenomenon. 

Let me close by asking what good is this theory? With Einstein (Fine, 
1986), I look at the test of it as follows: 


[a theory is] a self sharpening tool whose warrants and value in the end rest 
on this, that they permit the coordination of experience “with dividends” 
[mit vorteil]. (p. 91) 


What are the dividends? 
These are still speculations but the theory allows one to see: 


- Understanding as entailing folding back for the reconstruction of inner 
level knowing. 


- All levels of understanding as useful and critical. Each has its own 
actions, forms, and logic which should be considered in instruction. 
- Understanding is seen at many levels at once. 
- Curriculum making is a wrapping around actively. 
- Curriculum as well as understanding for the person can be observed to 
have a fractal quality. 
- Teaching is pointing the child ahead to possible new levels, but also 
back to reconstruct inner level understanding. 
- Understanding is an active quality of the person. Teaching for under- 
| standing cannot mean giving understanding. 
Finally, folding you back to my example at the beginning of this discus- 
sion, the theory allows us to see children’s understanding, such as Ronald’s, 
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not simply as a succession of disjointed products however well classified, but 
as a dynamic growing whole. 

Let me finish with a favorite quote from Maturana (Maturana & Varela, 
1980): “At this point there is much more to say, or nothing” (p. xxx). 

I hope I have left you with the feeling that there is much more to say and 
study about understanding for teaching for understanding. 


Note 


1. A 1990 McCalla lecture, Faculty of Education, University of Alberta. 
2. Research underlying this paper was supported by a SARAC Grant of the Faculty of 
Education, University of Alberta and a British Council Travel Grant. 


References 


Fennema, E., Carpenter, T.P., & Peterson, P.I. (1986). Studies of the applications of cognitive 
and instructional sciences to mathematics instruction. Madison, WI: Wisconsin Center for 
Educational Research, University of Wisconsin. 

Fine, A. (1986). The shakey game—Einstein realism and the quantum theory. Chicago: 
University of Chicago Press. 

Kieren, T.E. (1988). Personal knowledge of rational numbers: Its instruction and formal 
development. In J. Hiebert & M. Behr (Eds.), Number concepts and operations in the 
middle grades. Hillsdale, NJ: Erlbaum. 

Kieren, T.E., & Pirie, S.E.B. (in press). Recursion and the mathematical experience. In L. 
Steffe (Ed.), The epistemology of the mathematical experience. Berlin: Springer Verlag 
Psychology Series. 

Kieren, T.E. (in collaboration with S.E.B. Pirie). (in press). Rational numbers: From quotient 
fields to recursive understanding. In T. Carpenter & L. Fennema (Eds.), Cognitively 
guided instruction for rational numbers. Madison, WI: Wisconsin Center for Educational 
Research, University of Wisconsin. 

Maturana, H.R., & Varela, F.J. (1980). Awtopoeisis and cognition. Boston University, 
Philosophy of Science Series, Vol. 42, Dordrecht, D. Riedel. 

Maturana, H.R., & Varela, F.J. (1987). The tree of knowledge. Boston: The New Sciences 
Library, Shambhala. 

Minsky, M. (1989, October). Artificial intelligence. Discover, pp. 12-14. 

Pirie, S.E.B. (1988). Understanding—Instrumental, relational, formal, intuitive ..., How can 
we know. For the Learning of Mathematics, 8(3), 2-6. 

Pirie, S.E.B., & Kieren, T.E. (1989). A recursive theory of mathematical understanding. For 
the Learning of Mathematics, 9(3), 7-11. 

Pirie, S.E.B., & Kieren, T.E. (1990, April). A recursive theory for mathematical 
understanding: Elements and implications. Paper presented at the annual meeting of the 
AERA, Boston. 

Schroder, T.L. (1987). Students’ understanding of mathematics: A review and synthesis of 
some recent research. In J. Bergeron, N. Herscovics, & C. Kieran (Eds.), Psychology of 
mathematics education XI, Vol. 3. Montreal: PME. 

Skemp, R.R. (1987). The psychology of learning mathematics (expanded American ed.). 
Hillsdale, NJ: Erlbaum. 

Steffe, L. (1988). Children’s mathematics. A presentation to the International Congress on 
Mathematics Education. Budapest, Hungary. 

Tall, D. (1978). The dynamics of understanding mathematics. Mathematics teaching, 84, 
50-52. 

Tomm, K. (1989). Consciousness and intentionality in the work of Humberto Maturana. A 
presentation for the Faculty of Education, University of Alberta. 

von Glasersfeld, E. (1987). Learning as a constructive activity. In C. Janvier (Ed.), Problems 
of representation in the learning and teaching of mathematics. Hillsdale, NJ: Erlbaum. 


201 


bate! “DgPeliete ht ee waishatenen ‘ gi ereg 
4 a ee Ey tsa ae hee it 
cemeet (a uss ect erence ok oe 
ere 2b BIKE: ett ae 
ee ee Lars: tae 
ee rh 
gaan : angel “ee wa 
pie ~ ye ok ee i ” ities i ees ‘J 
| wi: “* , i Se Sa Pm 


. - oP: pate 
7 ® 
ay mitient 
° i mf ° 
é Teg 
i : , Pa ( 
hes 7 Het ‘ 
- Y ve a i ae 
A Ps) 
i a ye 
Ch af Le 
~ ee aay ae Ad 
© é : 4 : 
wy 7 ’ = a 7 > 5 J ,, " a ae eles . = 
4 : a Z 
a aa } 9 
: - 7 coc? ae aren - Ries: 
4 F i «dbs 
: . 


! 


itn omelet ' x 
7 a) a ¢ 1 = 4 awe aa an ie 
© = : spree « ve hs “ ae 


f ' ry LL 
weg Se i as Rat, ne 2 
ae , ; vs ri Conah res 
MR. oe". 5 15 alt oer. 7 Te Bier 3 a ; 


; ; set. : ne 

4 AD gestae Uli: Oat: dial : o ergy ees hie ina 
‘cued 22 ieesbegth 2a ieee eee 
argos : Sera wale, emit 
¢ 7 imo of eee - 

“et 24 aks " anowg Lnfiel Vendink- 

eer, tio oo ee eee 
” e ne wonomelohs 
rHAd) os) area eo ieee aes 
, ed 3 hike em ft 
Selah ae ig Sith Otek 
head. Geet eer Pages, G meen p= 


| , | ie T meee WF : 
vans 2 eae sai - 

is ee agi jae) oY er ; ‘ 
whedee Vii Scages ne  sanmateadling eit 


iT Aah Sea siete v 
3 al Hig fay? i 


night ithe elisa = 

posing ® me ar 
asia Ge 
Pind = 7 + 


“2 bo Se 


The Alberta Journal of Educational Research Vol. XXXVI, No. 3, September 1990, 203-222 


ARDRA L. COLE 
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Personal Theories of Teaching: 
Development in the Formative Years’ 


I introduce four first-year teachers who began teaching in different contexts and 
with different conceptions, expectations, and goals. Each has experienced and 
responded somewhat differently to the realities of her first year of teaching. I 
describe how they have been able to put into practice their personal theories of 
teaching, consider how their understandings have changed and developed over 
the first year, and discuss some of the conditions that seemed to enable or militate 
against the enactments of their beliefs. I conclude with some thoughts on the 
iumportance of collaboration between schools and faculties of education so that real 
change and improvement in teacher education might occur. 


Teaching is a complex and personal phenomenon. Each teacher’s practice is 
an expression of a personal and professional way of knowing that is shaped 
and informed by personal and professional background, experience, percep- 
tions, attitudes, beliefs, and goals. Research on the influences of life experi- 
ence on teacher thinking and teacher practice (Ball & Goodson, 1985; Butt, 
Raymond, & Yamagishi, 1988; Clandinin, 1986; Connelly & Clandinin, 1988; 
Feiman-Nemser & Buchmann, 1986; Knowles, 1988; Knowles & Charvoz, 
1989) and on teachers’ implicit theories, attitudes, and beliefs (Cole, 1988; 
Collay, 1989; Holt & Johnston, 1988; Hunt, 1987; Weinstein, 1989) provide 
considerable evidence to support the idea that teaching is much more than 
formal theory application or skill implementation. 


Personal Theories 


Students entering teacher preparation programs bring with them the beliefs, 
attitudes, ideals, and ambitions developed over years of life experience. As 
public school and university students, in some cases as parents, in others as 
community leaders or camp counselors, and in all cases as persons involved 
in human affairs, they have implicit thoughts and ideas about education and 
teaching. Having evolved over years of life experience, these ideas develop 
into relatively sound personal and pragmatic truths. These personal theories 
are informal and unvalidated (in the scientific sense) and in most cases 
remain unarticulated; yet they are the personal foundations on which an 
individual’s professional practice is built. 

The concept of personal theory has its epistemological roots in the writ- 
ings on personal knowledge by Ryle (1949) and Polanyi (1958) and in Kelly’s 
(1955) theory of personal construct. Similar to the more recent notions of 
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personal philosophy (Kroma, 1984) and personal practical knowledge, a term 

coined by Connelly and Diennes (1982) and developed by Connelly and 

Clandinin (1985), the concept of personal theory is most like Hunt’s (1987) 

notion of implicit theories and Handal and Lauvas’ (1987) practical theory. 
Handal and Lauvas (1987) define practical theory as: 


A person’s private, integrated but ever-changing system of knowledge, expe- 
rience and values which is relevant to teaching practice at any particular 
time.... “Theory” in this sense is a personal construct which is continuously 
established in the individual through a series of diverse events (such as 
practical experience, reading, listening, looking at other people’s practice) 
which are mixed together or integrated with the changing perspectives 
provided by the individual’s values and ideals. (p. 9) 


The point of departure for Handal and Lauvas’ notion of practical theory and 
mine of personal theory is in their use of the term in a collective sense. Based 
on teachers’ similarities in experiences, values, and beliefs they suggest that 
it is possible to develop a “collective practical theory.” In contrast, personal 
theory is a recognition and articulation of an individual’s characteristic 
uniqueness. Personal theory differs procedurally from the comparable con- 
cepts of personal philosophy and personal practical knowledge in that per- 
sonal theories are teachers’ not researchers’ interpretations. 


A Constructivist Approach to Teacher Education 


There would be little argument that teachers new to the profession begin 
their careers with enthusiasm, energy, and high ideals and aspirations; that 
the first year of teaching is a challenging and often frustrating and dis- 
couraging time for them; and that new teachers learn and discover a lot 
during that first year. The literature on teacher growth and development is 
rife with case studies, analyses, and descriptions of teachers’ experiences in 
the beginning years of teaching and their attempts to deal with the realities 
of classrooms and schools (e.g., Bullough, 1989; Lortie, 1975; Ryan et al., 
1980). 

Those of us who talk to first-year teachers or remember our own first year 
are familiar with charges of inadequacy against the preservice program in 
preparing teachers for the realities of classroom life. The impression is that 
the pedagogical theories and concepts studied at the university, although 
sound and important, are not necessarily relevant to what goes on in real 
classrooms. What is heard, read, and studied on campus contributes to the 
theoretical understanding of pedagogy, but in the real world, in classrooms, 
these theories do not play out. What goes on in school classrooms is actual; 
what goes on in campus classrooms is largely only possible. As we continue to 
struggle to improve the ways we prepare teachers for professional respon- 
sibility, we hope these same charges will, in the future, not be laid and that 
the connections between the theory studied at the university and the real 
world of practice will more easily be made. 

In preservice teacher education the gradual shift toward inquiry-based 
programs is illustrative of efforts to take into account the personal and 
complex nature of teaching. One approach receiving increasing support is 
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based on a constructivist view of teaching and learning (Clark, 1988; 
Feiman-Nemser & Buchmann, 1986; Goodman, 1988; Grimmett, 1988; 
Ross, 1988). This notion seems to hold particular promise for rethinking 
preservice teacher education. 

At Monash University, for example, each year for the past decade or so, 
prospective science teachers have graduated from a teacher preparation 
program based on constructivist views of education. One guiding principle of 
that program derives from the perspective of student teacher as construc- 
tivist learner with preconceptions of self, of science, and of teaching and 
learning. The teacher preparation program is designed to take into account 
and build on these preconceptions by providing experiences which enable the 
students to assess their existing knowledge and beliefs, reflect on how they 
relate to the new ideas presented, and rethink and discuss their ideas 
throughout the reconstruction process. 


The well developed views of teaching and learning [that student teachers 
bring to the program] are very persistent and often at odds with the views we 
hope to cultivate. Hence the views need to be identified, discussed and 
evaluated by student teachers in light of carefully managed teaching/learning 
experiences. (Gunstone & Northfield, 1987, p. 3) 


Taking into account prospective teachers’ preconceptions or personal 
theories of teaching and learning, and encouraging the notion of preservice 
teacher education as the first stage of career-long professional development 
and knowledge reconstruction (Gunstone & Northfield, 1987) seem to be 
realistic and appropriate principles to guide the education of prospective 
teachers. Presumably time spent in the preservice program making explicit 
and examining these preconceptions and developing skills in, habits of, and 
appropriate attitudes toward reflective practice will facilitate the develop- 
ment of self-aware and inquiring professionals who will then continue to 
examine and reflect on their teaching. 

What about those teachers who leave their teacher preparation programs 
having examined their preconceptions and developing conceptions of teach- 
ing and learning and who are intent on career-long inquiry? What is the first 
year like for them? Do they maintain their attitudes, perspectives, and 
ideals? Do they fare any better or worse for their self-awareness? 


A Study of Teacher Development 

The purpose of this study was to follow the development of the personal 
theories of teaching of four beginning teachers as they progressed through 
the first year of teaching and to identify factors and conditions that seem to 
enable or constrain the enactment of their beliefs. I present profiles of each 
teacher’s personal theory of teaching, describe each teacher’s first year prac- 
tical situation, and suggest ways that faculties of education and schools 
might better facilitate the professional growth of new teachers. 


Overview of the Study 


Four prospective teachers enrolled in a one-year after-degree preservice 
teacher education program joined me to begin an exploration of their implicit 
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theories, attitudes, and beliefs about teaching. When these four teachers left 
the university to begin their first teaching assignment, they took with them 
new and important knowledge and skills related to teaching, learning, and 
education. Like most beginning teachers, they had studied theories and 
practices of learning and development; considered issues in education from 
philosophical, psychological, sociological, and political perspectives; learned 
skills and techniques associated with effective teaching; and studied and 
experimented with various methods of and approaches to delivery of cur- 
ricula. They also took with them important knowledge of a different sort— 
not new, but articulated personal knowledge. Unlike most beginning 
teachers, they had also spent time studying their own preconceptions and 
developing conceptions of various aspects of education. 


The BPT Program 


Over the course of the academic year these four participants, then student 
teachers and at the time of this study first-year teachers, began a program of 
self-inquiry through a series of experiential reflective activities, group discus- 
sions, and individual in-depth interviews. By the end of the preservice pro- 
gram each had articulated, orally and in writing, a Beginning Personal 
Theory (BPT) of teaching with which they were about to enter the profession. 

A detailed description of the nature and content of the self-inquiry pro- 
gram and an analysis of student response to the BPT program is presented in 
an earlier paper (Cole, 1989). A brief description of the program and articula- 
tion process follows. The content of the BPT program consisted of 10 experi- 
ential activities designed to correspond with the main topics covered in an 
educational psychology course (e.g., styles of classroom management, effec- 
tive teaching, roles of the classroom teacher, learning styles, discipline). The 
procedure for the completion of each activity was based on the four-phased 
experiential learning cycle (adapted from Kolb, 1984 by Hunt, 1987). 

Each activity began with an imaging exercise in which the students were 
asked to recall and recreate a personal experience relating to a particular 
issue or topic. This experience provided the basis for reflection. The students 
then in some way recorded the event describing their perceptions of and 
responses to it. Either independently or through group discussion and analy- 
sis they made sense of the experience drawing conclusions about their under- 
standings. These conclusions were articulated in the form of summary 
statements that they were then able to test over a period of time to see how 
they fit with the formal theories they were studying, as well as with the 
teaching practices they were observing during their practica. 

We met as a group after each practicum period to discuss their developing 
conceptions. At the end of the program, after this period of examination, the 
students reported their results. They articulated their conceptions in a com- 
prehensive explication of a Beginning Personal Theory of Teaching. This 
final statement was a synthesis and extension of the insights gained from 


each BPT exercise. The transcripts of our recorded discussions provided 
additional insight. 


206 


Personal Theories of Teaching 


In addition to statements about their attitudes toward and beliefs about 
various educational issues, the four teachers made statements of intent about 
their teaching and their classrooms. They expressed visions and ideals about 
what their classrooms would be like and how they would teach. They stated 
expectations about the difficulties and demands of the job. They were pre- 
pared to be challenged in many ways and hoped that they would be able to 
successfully meet these challenges. Most of all, they expressed enthusiasm 
and eagerness to begin, satisfied with their career choice. 

The inquiry has continued through the teachers’ first full year of teaching. 
The teachers and I have continued to meet to study, through discussion, their 
personal theories of teaching. We have considered how they have been able to 
put their theories into practice, how their understandings have changed and 
developed over the first year, and we have discussed some of the conditions 
that seemed to enable or militate against the enactment of their beliefs. 

I would like now to introduce the four teachers and present brief profiles 
of the personal theories of teaching with which they entered the profession. 
Following each one through the first year, we see how each teacher experi- 
enced and responded to the realities of her first year of teaching, “doing [her] 
damnedest” to enact her beliefs, “no holds barred” (Bridgman, 1945, p. 445). 
This provides the basis for some reflections on how, as teacher educators and 
experienced practitioners, we can better understand and facilitate teachers’ 
development in the formative years. 


Wendy 
Wendy enrolled in the primary/junior component of the preservice teacher 
education program immediately on completion of an undergraduate degree. 
Because she wanted to teach children in the early primary grades, Wendy 
took an elective kindergarten course. Her first job was a half-time position 
teaching kindergarten. 


Wendy’s Beginning Personal Theory of Teaching 


The following represent views Wendy repeatedly and consistently expressed 
in her written statements and in our interviews and discussions throughout 
the preservice year; they seem to reflect the fundamental principles of her 
beginning personal theory of teaching. Wendy gives importance to: 

- providing the children with a warm, accepting, inviting, and stimulat- 
ing learning environment; 

- providing positive and successful learning experiences; 

- providing meaningful and relevant learning materials, “[materials] 
that are relative to their ability and to their age, things that are in good 
shape”; 

- providing opportunities for active involvement and discovery; 

- being sensitive and responsive to the children’s individual needs and 
differences; 

- providing guidance and support for the development of individual po- 
tential. “Each child has to develop at his or her own potential and rate, 
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and to the best of his or her own ability. That should be the goal for both 
[teacher and student]”; 

- fostering a desire to learn; and, in short, 

- “mak[ing] a difference in the lives of the children that pass through my 
classroom.” 


Theory in Practice 


Wendy had definite ideas about what she wanted her classroom to be like, the 
kind of first school experience her children should have. She had articulated 
her beliefs, goals, and ideals during her preservice program and was deter- 
mined that they be expressed in her practice. She was presented with a 
number of obstacles that she routinely set about to overcome, often at 
considerable personal cost in time and money. In order to be able to realize 
her objectives she had to be extremely resourceful and innovative. She work- 
ed at modifying her classroom and her program to suit her personal image of 
her practice. “I was hired just before school started,” she says, “and every- 
thing has been struggle, struggle, struggle just to get on top of it and stay one 
step ahead of the children.... But I’m enjoying it.” 

Like most first-year teachers, Wendy was faced with a less than ideal first 
assignment: lack of space, materials, resources, and a group of children she 
described as “all the leftovers: children who should be in another school but 
who wanted a half-time program so they came to my school, a mixture of 
different sorts of children, a lot of only children.” On the bright side, though, 
Wendy’s first assignment was only half time (with half-time salary, of 
course). She was able to use the rest of her day for planning, developing, and 
collecting materials and ideas, and for trying to modify her classroom and 
program so that her practice could be more consistent with her beliefs. 


Space and Ambience 


Wendy’s classroom was located in what she called an overflow room, an area 
in the school that because of its limited capacity and accoutrements was 
ordinarily used only when extra space was required. Situated in this less than 
adequate physical space, Wendy had to set about remodeling the area, to 
“basically construct a single kindergarten.” The severe shortage of space 
made the provision of opportunities for active involvement and self-discovery 
difficult, but with some effort and imagination she was able to approximate 
her goal. She did not have the space to set up a puppet center, for example, 
and she had to construct a barrier between the sand and water tables because 
they had to be placed side by side, “and they really shouldn’t be.” She had to 
limit the number of people allowed at the painting center at one time because 
she had only half an octagon-shape table for painting. “It would be really nice 
to have a whole one.... But there’s not a lot of room.” 

The writing center she had created became so popular with the children 
that she had to try to expand it as best she could. “At first there was just half 
a table, and room for three people. Well, so many people wanted to go there 
all the time, and I didn’t want to discourage people going to the writing 
center so | expanded it, made it bigger; recreated my room: added another 
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table, a typewriter, tracing letters, markers, anything that I could get hold 
of.” 

Wendy made a number of modifications in an attempt to make the 
classroom “as welcoming and as stimulating as possible.” Sleeping bags and 
pillows made the reading corner “a little more comfortable.” Having plants 
in the room, decorating the walls with children’s work and “some other basic 
things” such as the alphabet and numbers, and using bright colors were all 
attempts to “stimulate the room.” Not having a carpet in the main group 
area was remedied when the superintendent visited her classroom and re- 
commended one. “That will make it nicer. I think it’s pretty warm, except for 
the concrete.” 


Materials and Supplies 


Lack of supplies made it difficult to provide meaningful and relevant learn- 
ing materials. A shortage of painting easels in the school restricted Wendy to 
“flat drawing.” “Besides,” she says, “there isn’t enough room to set one up 
anyway.” 

It’s ideal when you think of the things you could do with the children but 

when you don’t have the supplies.... For example, I don’t have pattern blocks. 

I don’t have unifix cubes or any sort of connecting cubes, anything along that 

line.... I had to make a huge order and luckily [the Board office] okayed some 

of it. I was ordering things like puzzles, masks, and various other materials. I 

find [the lack of supplies] very limiting. So I try to make it up in other areas.... 

It could be a lot more stimulating for the children if the materials were there. 


It’s fabulous what we learn about teaching math or language, or whatever— 
great ideas—but if you don’t have the supplies to do it then you don’t have 
the supplies to do it. It’s very well and easy to say why not ask the parents or 
do this or that but it takes a lot of time to get that kind of thing organized. I 
find I’m basically just one step ahead of the children. 


Because the library in the school had relatively few books appropriate for 
very young children, Wendy made regular trips to the public library in her 
home town (about 100 km from the school). “Every second or third Saturday 
I go and change the books.... It gives the children a bigger selection, more 
variety. And they bring their books back when they go to the library. And that 
adds to the choice for all the other children too.” 

Wendy strongly believed in the Whole Language program used in her 
board and wanted to use “Big Books” and “predictable books” but found that 
she did not have easy access to those materials because the borrowing policy 
required that all requests for the materials be made at the end of the last 
school year before she was hired. But she was able to come up with a solution. 


I’m sharing. I’ve discussed it with the other kindergarten teacher. When she 
gets a Big Book in she’s going to try and use it at the beginning and then give 
me time at the end. At least they’ll get some of it. 


And it’s the same thing with the idea of using poetry. I think the idea is 
fantastic but where do you get those poems? How on earth can you put these 
ideas into practice when you can’t find the resources to do it with? I saw [the 
person who introduced Wendy to the idea] at the last professional develop- 
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ment day and he recommended a few books. I tried to get down some of the 
names. But, you know, I go out and buy a couple of books and it comes out of 
my pocket. At half a salary that really hurts. 


Program Emphasis 


Wendy maintained her view of the need for sensitivity and responsiveness. 
She described her program goals as being consistent with the needs of the 
children in her room. 


So for my kids, it’s a lot of just learning how to play together cooperatively... 
In another class my goals might be quite different.... You need to just plan to 
what you think your children are capable of accomplishing.... I know the 
abilities of my kids at this time and that’s what I’ve got to focus on. 


When Wendy first began to make explicit her ideas and ideals of teaching, 
she spoke often about her ambition to provide a warm and accepting environ- 
ment for the children, to be able to strike the balance between providing 
freedom and maintaining control. 


The teachers I really liked were the teachers who could relate to me, or who 
were warm and sensitive. I think more than anything that’s what I try to do, 
and yet keep my thumb on things. When I think back that’s what I always 
remember. And someone who was able to keep control over a class and make 
it interesting. That’s what I strive to do. ° 


Wendy’s summary comment on her teaching practice is reminiscent of 
her earlier statements in her Beginning Personal Theory of Teaching. 


One of my main goals is for all my children to enjoy being there, enjoy coming 
to school, [to think that] school is a great place to be.... The second is to 
stimulate them as much as possible in the way of language—oral and writ- 
ten—and reading ... and the different levels of reading.... All I ask is that they 
do their best.... It seems like I have one main role that is basically my goal, and 
that is to make it a nice place for children to be. In that sense I’m a provider: 
a provider of materials, or a provider of warmth, or of nice comments or 
discipline or whatever. It seems like everything else stems from that. Every- 
thing comes back to one main goal—to be there for the children.... [A good 
day] is just the little things [the children] say, like “It’s really nice to be here. 
I really like being here.” It makes you feel good to know that [school] is a 
positive thing for them. 


Despite what seems like a sink-or-swim situation, Wendy insists she is “in 
a good school,” that she has been well supported by the other kindergarten 
teacher, the principal, and the other teachers in the school. She has worked 
hard to make her practice an expression of her explicated beliefs. 


Monique 
Monique enrolled in the intermediate/senior section of the preservice teacher 
education program immediately following completion of an undergraduate 
degree. She wanted to teach high school French and/or English. Her first 
full-time teaching assignment was grades 9 and 12 English. 
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Monique’s Beginning Personal Theory of Teaching 


The themes recurring in Monique’s writing and talking about her teaching 
reflect the importance she places on the relevance of education and on 
personal and social responsibility. She expressed a need to 

- make learning a living experience; 

- prepare students to be worthwhile members of society; 

- guide students and give coherence to classroom activities; 


- help students become autonomous, self-aware, and self-appreciating 
learners; 


- help students learn to accept responsibility for their actions; 
- establish and maintain mutual trust and respect; and, above all, 
- remain a professional. 


Theory in Practice 


Monique’s commitment to her beliefs and goals was so strong that she 
decided on her first teaching assignment, based on the opportunities that 
would be provided her, to maintain what she called a “moral focus.” Al- 
though she was qualified to teach both French as a second language and 
English and had job opportunities in both areas, she was hesitant about 
choosing to teach French for fear the subject might carry with it certain 
constraints. “I thought it over very carefully. Finally I said, ‘I can’t get 
enough of that [moral/social] focus in teaching French.’” So she opted to 
teach English where she thought she would have a better chance of putting 
into practice her beliefs. 


I think it’s so important that students today learn to make decisions and live 
with them and be responsible for their actions and think about what they’re 
doing in the world. In a lot of courses you don’t get an opportunity to do that. 
That’s why I chose English over French.... I think [students] should know 
where they stand on issues. 


Monique was guided through her entire first year by her attitudes and 
beliefs about the importance of establishing mutual trust and respect and 
maintaining a professional image. At one point during the year she expressed 
frustration over the persistence of this struggle. 


I look young ... and they see me as their age and that’s very difficult. It’s very 
difficult to remain professional.... With some of the students I’ve managed to 
remain professional but with others I haven’t so the entire atmosphere of the 
class is not professional.... I don’t like it at all. ’m looking forward to my next 
class so that I can start right from the beginning to be professional using what 
I’ve learned.... It’s very uncomfortable for me to be in a classroom and not be 
seen as a professional. I don’t think my students are benefiting as much as 
they would if they saw me as a professional.... Classroom control stems from 
mutual respect. If you don’t have that then learning can’t happen because 
they don’t see anything you do as important. They don’t have enough con- 
fidence in you. 
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Reflecting aloud on her experiences of her first year she expressed par- 
ticular satisfaction with her success at having ultimately achieved her goal of 
professionalism. “Finally, I do feel professional; but my idea of professional 
has changed.... Now I’ve realized that age doesn’t have anything to do with 
it.’ The problems she had experienced with her grade 12 class and had 
attributed to the closeness in age of herself and the students, she discovered, 
were not age related in the sense that she initially thought. The problems 
were caused, rather, by her failure to “treat the students as mature adults.” 
They were rebelling against her treating them as younger students. By 
“starting off differently” with the new class in the second term she was able 
to discover how to establish the mutual trust and respect she sought from the 
beginning. Treating them as adults also enabled her to help them “learn to 
accept responsibility for their actions” and to “help them be better prepared 
for the work world.” 


My image of an unsmiling, straightforward, organized person in a business 
suit and carrying a briefcase is no longer my image of a professional. That’s 
not what it is to be a professional in teaching. I’ve finally come to grips with 
that. I have finally shown people that I am a professional. 


She did this by demonstrating her ability to establish and maintain a 
“wonderful” rapport with the students, showing her competence in her 
contribution to the school musical by “single-handedly costuming 66 kids and 
getting over 250 costumes free,” having students come to her for advice and 
help, and even having a parent seek her advice. This last incident confirmed 
Monique’s success. “I felt great. I thought, “She must regard me as a profes- 
sional even though I’m such a young pup.’” 

Monique’s overall impression of her first teaching experience was favor- 
able, although in the beginning she was terrified. She thought all the students 
were going to be major discipline problems and that she wouldn’t be able to 
control them, but it turned out to be much easier and better than she ever 
thought it would be. “It’s just so fun to go to work in the mornings,” she said. 


I think I’ve been successful with some students, making them all of a sudden 
realize how fun it can be to learn. That makes me very proud and happy. The 
students do like learning in my class. Some students whom I’ve never seen do 
anything at all before are starting to do the work. They’re not bored in my 
class.... Maybe it’s just this once but still they have understood what it’s like 
and how thrilling learning can be.... Maybe later on in life they’ll want to keep 
on learning.... Just what little bit they do enjoy, that’s really important. 


Monique has completed her first year satisfied with what she has been 
able to accomplish. She has been able to put into practice many of the ideals 
she had articulated in her preservice year and has reconstructed some of her 
understandings. Her perceptions of how she has begun to construct her 
practice suggest that she has not had much difficulty enacting her beliefs, 
attitudes, and goals. It is clear from talking with Monique that she is happy 
in her chosen profession and is likely to continue. 

What factors have contributed to the satisfaction Monique has experi- 
enced this first year? Monique herself was instrumental in engineering her 
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own success with her choice to teach English over French. But there were 

several other factors that worked in her favor: 

1. She had classes of reasonable size. The enrollment in each of her classes 
was less than 30; and the “general” level classes (which sometimes are 
comprised of particularly challenging students) each had less than 20 
students; 

2. Her department and the school she was in had a philosophy of education 
and approach to teaching that Monique understood and shared; 

3. She had two periods for planning each day which she assumed made a 
big difference; and 

4. She had a department head and other people in her subject area to 
whom she said she always went for advice and support. She said that 
she was encouraged at the first of the year to seek help from one of the 
experienced teachers in the school who would be her “mentor.” And she 
regularly did. 


It’s important to have an idea of what your goals are and what you believe in 
before you go into your first teaching job. That really helped me. I was lucky 
to get into a school that enabled me to carry out some of the things I wanted 
and to figure out some of the things I wasn’t sure about. I think it’s important 
to realize that every school is different. If I couldn’t do what I wanted in that 
school, I’d move on until I found a school where I could grow. 


Mary Ellen 

Mary Ellen had a background in science and music and was interested in 
teaching at the upper elementary level (grades 4-8). She too enrolled in the 
preservice teacher education program immediately on completion of an un- 
dergraduate degree. Mary Ellen did not have a regular full-time teaching 
position her first year. Instead of having her own classroom she was hired for 
two consecutive terms (September to December and January to June) to 
replace two teachers on maternity leave. Her first assignment was a split 
grade 5/6 class in a small rural school; the second was in a large urban school 
teaching language arts and visual arts to two grade 3 classes of students 
enrolled in a French immersion program and music to three grade 1 classes 
in the school. 

Mary Ellen had an exacting first year. But despite the concomitant dif- 
ficulties of supply teaching and of having to leave one class halfway through 
the year to take over another class dramatically different in almost all 
respects, Mary Ellen put in long hours of hard work toward the realization of 
the objectives she had set in her preservice year. She was prescribed a double 
dose of the isolation, personal trauma, and professional hardship often expe- 
rienced by first-year teachers. But despite frequent bouts of self-doubt and 
the considerable personal costs in time, energy, and social life, she was able 
to actualize many of the ideals she had articulated. 

To understand why Mary Ellen found her first year to be so demanding, 
one must first be apprised of the key elements in her personal theory of 
teaching. With this knowledge and an awareness of the nature of her teach- 
ing assignments will come an explanation of the impact on Mary Ellen of her 
first year of teaching. 
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Mary Ellen’s Beginning Personal Theory of Teaching 


What is most important for Mary Ellen in order for meaningful learning to 
occur is to acquire 


a good knowledge of the students: how they learn, where they are at emotion- 
ally, socially and cognitively, what is relevant to them.... Each child is dif- 
ferent and has different needs and sensitivities.... You have to be sensitive to 
them, to try and get to know them ... and see what kinds of things they like, 
what areas they feel strong in or don’t feel strong in, find out what things 
they excel in. 


Metaphorically, Mary Ellen describes teaching 


as gardening and the teacher as the gardener who tends to his or her plants 
in order to help them grow to their full potential ... helping them to develop 
healthy self-concepts and a sense of self-worth, and to help them learn 
meaningfully and have successful experiences.... Each child needs to grow, 
and grow strong.... You have to try to figure out what the needs of these 
plants are, how they can best grow and blossom.... When I think of a garden I 
think of the beauty of the plants and I feel that way about the students too. 
That’s why I’m there, to plant and nurture and take care of them. Probably 
the most important thing to me about teaching is helping the kids to grow. 


Theory in Practice 


Getting to know the students is a time consuming and gradual process. In her 
first assignment Mary Ellen worked hard to do so. She arranged time and 
opportunity to interact with the students, 


talking to them and writing back and forth in their journals.... I try to get a 
dialogue going. I do intramurals so I get to see them at lunch time and I do 
student council and choir so I get to see them at those extracurricular 
activities. And in class when I can. Sometimes when we’re conferencing in 
writing or reading I find out what they’re interested in and [follow up on 
that].... Just listening to their conversations is another way to get to know 
them. 


She was pleased when she was finally able to arrange the learning activities 
so that she could “stand back and observe them, watching how they work 
together and how they go about doing tasks.” 

All of this information and knowledge about the students helped Mary 
Ellen to design appropriate and meaningful learning experiences. As the first 
term progressed, so did Mary Ellen’s satisfaction with her teaching. The 
better she got to know the students, the better she was able to meet their 
individual needs and differences, and the more comfortable and self-confi- 
dent she became. By the time her first assignment ended just before Christ- 
mas, she felt that she was “really getting somewhere.” She had designed and 
developed some individualized programs based on the students’ particular 
interests and abilities and had finally been able to make gains in the estab- 
lishment of the kind of “child-centered program” she had envisioned in her 
preservice year by arranging for special outings and guest speakers, and 
having students actively involved in their own learning. 
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And then, in the middle of the year, Mary Ellen was thrown into two new 
homeroom classes, which she saw for only one and a half hours each day, and 
three music classes which she saw three times a week. Taking on 138 new 
students caused Mary Ellen to have a “not so satisfying second term.” “Not 
being able to get to know the students and consequently not being able to 
gear my program as accurately as I would like has been frustrating.” Reflect- 
ing on the second placement she talked more about what she was not able to 
do with and provide for the students than what she had been able to ac- 
complish. Although she was pleased with some of the learning unit activities 
she had developed, the individualized approach to language used in the 
school, and the potential of the program she was using with the students, she 
felt that she was somehow being negligent. “I feel guilty. I should be doing so 
much better for these kids. I’m not giving them what they should be getting.” 
Certain that “some are falling through the cracks” she felt helpless but ready 
to assume responsibility for not having enough time with the students, for 
not knowing enough about how the writing system worked, and not knowing 
enough about teaching reading and language to primary children (even 
though she was prepared as a junior-intermediate teacher). Establishing 
rapport and forming a personal attachment with the students is important to 
Mary Ellen. She was unhappy about the lack of time and opportunity to do 
this. Even though only having to teach two subjects to her two homeroom 
classes meant that she had considerably less preparation to do, Mary Ellen 
expressed her wish for more time with the students, more opportunity to get 
to know them better “even if it means a lot more planning,” and more 
opportunity to be spontaneous and responsive to their interests, “letting 
them direct where the learning will go.” 

Mary Ellen was pleased with the assistance and support she had been 
provided by colleagues in both schools and by the board. She had been 
assigned to an experienced teacher in the first school who spent considerable 
time with her, especially in the first week until the time commitment became 
too great and they had to stop their long after-school conversations. In the 
second school she and three other first-year teachers had formed a support 
group which met regularly to share ideas and concerns. She was particularly 
grateful for the monthly meetings of information sharing and support ar- 
ranged during the first term for all teachers, both new to the profession and 
new to the board. She also felt supported and encouraged by the consultants 
who offered “a shoulder on which to cry.” 

Although the assistance and support were welcome, the circumstances of 
her teaching situation precluded many opportunities for Mary Ellen to expe- 
rience success in her first year. The values and beliefs that needed expression 
in order for success to be achieved were restrained by circumstance. But 
despite the isolation, trauma, fatigue, and frustration Mary Ellen experi- 
enced in her first year, she remains committed to teaching and convinced 
that she had made the right career choice. 


I really like teaching. There are a lot of neat things about it that you wouldn’t 
have anywhere else.... I don’t think it’s easy.... I’m probably a lot less idealis- 
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tic.... There’s a lot of work but I think it’s worth it.... I have to think that all 
of this will make me a better teacher. 


Elizabeth 

Elizabeth, like Monique, wanted to teach French as a second language and 
English to intermediate/senior level students. Like the others, she enrolled in 
the preservice program directly following the completion of an undergradu- 
ate degree. Elizabeth was hired midway through the preservice program to 
teach French to kindergarten and grades 3 to 8 beginning the following 
September. Elizabeth’s teaching situation in her first year was perhaps the 
most incongruous with the goals, ideals, and expectations she made explicit 
in her preservice year. The frustrations, disappointments, and difficulties she 
experienced are painfully evidenced in the contrast presented by her teaching 
ideals articulated through metaphor and the reality of her first assignment. 


Elizabeth’s Beginning Personal Theory of Teaching 


I am reminded of the English romanticist Shelley who likened the mind in 
creation to a fading coal that is enlivened and returned to its “transitory 
brilliance” by an “inconstant wind.” As a teacher, I am the leader and source 
of inspiration in the classroom. I am the wind that propels the windmill. The 
students are the burning coals that brighten with inspiration as the wind hits 
them. Their intelligence is aroused when the inspiration moves them.... The 
wind generates the windmill and soon the windmill moves on its own with the 
help of the wind.... Without the driving force of the wind the windmill 
becomes stagnant and motionless. The wind gives the windmill a purpose to 
exist because without [it] it is helpless. The wind may change the windmill’s 
pace through its varying strengths and weaknesses. Its gusts set the atmos- 
phere for and endurance of the mill... As the wind generates the mill, it 
propels itself with energy.... | [want] to inspire my students to be creative, 
imaginative, and self-motivated and to try to find their hidden thoughts of 
“transitory brilliance.” 


Theory in Practice 


Elizabeth left the preservice program to begin her career as a French teacher. 
She was enthusiastic about her subject and about being a teacher. She was 
eager to get to know the children and help them discover themselves. She 
wanted to inspire them “to be creative, imaginative, and self-motivated and 
to try to find their hidden thoughts of ‘transitory brilliance.*” She wanted to 
instill in them an appreciation of knowledge and a love of learning. She had 
the kind of ideals and attitude toward teaching and learning that we want all 
our graduates to have. She was committed to the profession and a career of 
lifelong learning and professional growth and was prepared for hard work 
and long hours, expecting frustrations, disappointments, and, like all begin- 
ning teachers, perhaps some difficulties in the area of classroom manage- 
ment and discipline. 

Her first year did not turn out quite as she had anticipated. Placed in an 
especially challenging and demanding teaching situation for a first-year 
teacher, she was hardly able to do more than just survive. Teaching seven 
different classes in seven different classrooms meant that she was always in 
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someone else’s room. Because she did not have a classroom of her own she 
had difficulty commanding respect from the children, “establishing herself” 
with them and with herself for that matter. They did not see her as their real 
teacher and that made classroom management and discipline even more 
difficult than it might ordinarily have been. She says, 


At times, I still feel like a student teacher because I still have someone else’s 
classroom.... The students act up for the music teacher and the French 
teacher because we’re not their “real” teachers. They have no respect for us. 
Their homeroom teachers are their “real” teachers.... And if you only have 
them for 40 minutes each day, well ... I have tried to be down to earth and on 
the level with the kids but it’s different in this situation. They just act up. 
They look at me as being young and a new teacher and not their real teacher. 


In addition to the problems brought about by her always having to travel 
from class to class, she felt severely constrained by the subject she was 
teaching. Despite wanting so much to “reach the children on a personal 
level,” she felt that the requirements to, for example, speak French 80-90% 
of the time and to evaluate the children’s language competence made impos- 
sible the kind of personal interaction she wanted to take place. 


The only time I can [get to know the students] is on yard duty or whatever. I 
can talk to them and find out about them that way.... I just don’t have the 
opportunity [to get to know the children]. If 1 were to have a discussion [on an 
issue], for example, I could really only do it with the older grades. You can’t 
build the kind of rapport you need. 


Elizabeth was excited by the enthusiasm of the very young children but 
not hopeful that it would continue. “In kindergarten, they’re so keen. It’s all 
so new to them. As soon as they see me they say, ‘Bonjour.’ They’re so keen. 
If they can only keep it up to enjoy French until they’re older.” Early in the 
year Elizabeth spent considerable time trying to think of imaginative ways to 
help the children maintain or in some cases rekindle their interest and 
enthusiasm for learning French. She tried to make her classes as exciting as 
possible with special activities at Christmas, and games and activities that 
she was certain they would enjoy. But despite this she wasn’t really able to 
arrange the teaching-learning situations in the way she would have liked. 
She tried such methods as paired learning activities, but again, because of the 
demand for a teacher-directed approach, she was unable to do this as often as 
she wanted. As the year progressed and Elizabeth “became more realistic,” 
the evenings previously spent planning until 9:00 or 10:00 were devoted to 
recreational activities that helped her “get her mind off it.” “You know what 
you’re doing tomorrow,” she would tell herself, “so think of yourself for a 
change.” It upset her so much that the parents and students did not share 
her enthusiasm for her subject that she developed what she called a “more 
realistic attitude.” 

As much as you think you might really change things you can only really 

change them in a small way. If you’ve reached just one student, that’s great, 


at least from my perspective now. I thought I was going to make the ae 
love French. I’m leaving at the end of this year thinking, “That’s hogwash! 
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I thought I could make a difference and now I’m finally starting to realize I 
can’t. 

[The Faculty of Education] is marvellous. It’s really ideal. But when you’re 
really teaching it’s not the ideal situation.... [The Faculty] is like the ivory 
tower: Everything you read in the textbooks is marvellous, you have your 
friends there after practicum, you have their support and can exchange 
experiences, and it’s all very interesting. Once you’re out on your own you 
realize that. This is it and it’s real! ... Oh, you learn, you learn. 


Elizabeth was given the opportunity to move at the end of the year to 
another city and another board of education. Her experience in her first year 
helped her decide to make the change. The prospect of moving to a high 
school was exciting to Elizabeth because it meant that she could teach at the 
level for which she was prepared. It also meant that she would have her own 
homeroom class and that she would be perceived as a real teacher. Because 
she was applying to teach mainly English (her major) and perhaps some 
French (her minor) she would have better opportunity to put into practice 
her personal theory of teaching. 

Elizabeth had not given up completely. She held fast to her vision of 
teaching, her windmill metaphor, saying that she still believed strongly in the 
ideas she had articulated, but suggested that they would have to wait for 
practical expression in a more conducive teaching situation. “I still believe in 
it.... You have to generate the enthusiasm and you have to get things going 
and get the kids to participate.... But I think [my metaphor] must be more 
applicable to the English classroom.” Next year, in another city, teaching a 
different subject to classes of older students she will try again. I hope 
Elizabeth’s teaching situation in her second year will enable her to enact her 
beliefs and that she will find the renewed optimism and energy necessary to 
regenerate the windmill and enliven those fast-fading coals to their tran- 
sitory brilliance. 


Facilitating Teacher Development 

Teacher knowledge, like teaching practice, is personal, complex, and context 
dependent. How teachers understand and go about their work is a function of 
who they are, what they believe in, what conceptions they hold, what they 
aspire to, and the nature of the environment in which they work and interact. 
If we assume that development takes place through a process of reorganiza- 
tion and reconstruction of existing knowledge and concepts and that teachers 
redefine their knowledge as they act and interact in their professional con- 
texts, we can conclude that what takes place in preservice programs and in 
institutions in which new teachers are employed in the beginning years 
influences, in important ways, their development. 

Feiman-Nemser and Buchmann (1986) argue that “effective teacher pre- 
paration needs to pay attention to the prior beliefs of candidates and also 
prepare them to learn from their teaching in ways that go beyond the typical 
trial-and-error approach and reliance on personal preference.” They go on to 
suggest that teacher education can help prospective teachers understand 
their prior beliefs and dispositions and, if inadequate, can help to bring about 
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change (p. 2). Ross (1988) suggests that teacher educators help preservice 
teachers become more aware of their past experiences and preconceived 
beliefs about teaching “simply to expose individual beliefs to critical ex- 
amination ... [so that they will learn] to engage in systematic analysis of their 
own curricular goals and methods” (p. 5). By encouraging beginning teachers 
to understand their implicit theories and beliefs we help them, as Polanyi 
(1958) suggests, to bring to light and affirm the beliefs implied in their 
thoughts and practices, discover what they truly believe in, and formulate 
convictions they hold (p. 267). 

But if we hope, as Fullan and Connelly (1987) suggest, to reconceptualize 
and restructure teacher education and teacher development to more appro- 
priately reflect the continuing and developmental nature of becoming a 
teacher and to more fully acknowledge the complexity and individuality of 
teaching practice, we must begin to think about how best to facilitate this 
process along the continuum. This means going beyond the preservice 
program to the classrooms and schools that receive new teachers. We need to 
begin to focus our attention on preparing a school context that is conducive 
to growth not endurance; that welcomes new teachers and helps them con- 
tinue to engage in systematic reflection on and analysis of their teaching; and 
in which all the new teacher’s energy in the early years of teaching is not 
spent just trying to survive. 

The four teachers in this study began their careers in different settings 
and with different conceptions, expectations, and goals, and each experi- 
enced the first year of teaching somewhat differently. For Monique, the year 
was pretty much as she had envisioned. For the most part, she was able to put 
into practice her beliefs and ideals and to knowingly develop her under- 
standings. Her classroom and her teaching were, by and large, what she 
hoped they would be. Monique was welcomed into a supportive teaching 
environment where she was given an appropriate and realistic assignment 
and where she felt sufficiently comfortable and confident to grow profes- 
sionally. For instance, she was able to reconstruct her conception of one 
integral component of her personal theory—what it means to be a profes- 
sional—a significant point in her development. She was lucky, she said, to get 
into a school that enabled her to carry out some of the things she wanted and 
to figure out some of the things she was unsure about, and she found it 
helpful to have a clear idea of her goals and beliefs before she started. As a 
result, she knowingly evidenced professional growth. 

The others were not so fortunate. For a variety of reasons, Wendy, Mary 
Ellen, and Elizabeth experienced varying degrees of difficulty, frustration, 
disappointment, and success. They responded in different ways to the real- 
ities of their first year of teaching and their not being able to teach the way 
they thought they would or could. They all tried in the beginning to modify 
their situations to more accurately reflect their personal theories, attitudes, 
and beliefs. This proved more difficult for some than others. With time, 
effort, and resourcefulness Wendy was able to approximate her vision of her 
practice. Elizabeth finally became resigned to her situation and decided to 
put her ideals on hold, looking to the future for a better opportunity to enact 
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her beliefs. Mary Ellen too became “more realistic.” After almost an entire 
year of what seemed to her like incessant fatigue, she finally gave in to her 
situation, temporarily reconstructing her views to be more congruous with 
her situation. Resignment, disillusionment, and frustration—not growth— 
characterized their first year of teaching. 

All four teachers grew professionally in their first year, but the extent to 
which their potential for growth was realized was contingent on their practi- 
cal situations. It is interesting and somewhat ironic that the vision they had 
of themselves as teachers could be used just as aptly to describe a vision of an 
ideal growth environment for them as learners. Wendy needed someone to 
provide for her; Mary Ellen needed care and nurturing to help her grow; and 
Elizabeth, herself a fading coal, needed to be enlivened and returned to her 
state of transitory brilliance. 

It is well known that many new teachers lose the enthusiasm and energy 
with which they enter the teaching profession as they are coerced into 
exchanging their ideals for reals. Because the early years of teacher develop- 
ment are formative, schools and faculties of education must work together 
sharing a vision of and responsibility for teacher education; otherwise, pro- 
grammatic restructuring in either context will remain limited in its potential 
to effect real change and improvement in teacher education. Teachers leav- 
ing preservice programs that have been restructured to promote career-long 
inquiry and growth must be received by schools and school boards who are 
prepared to support and facilitate the continuation of such practice. We in 
faculties of education must be careful not to prepare new teachers for teach- 
ing roles that do not now exist, or do not have the sanction of the institutions 
in which teachers now work (Zeichner & Liston, 1987). Change that is not 
shared might at best be futile and at worst be harmful to new teachers and a 
disservice to the children in their classes. 

If teacher education is indeed to become a lifelong developmental process, 
preservice teacher education must become the official starting but not ending 
point of personal and professional inquiry. Inquiry into one’s own practice 
begins with an explication and examination of the foundations on which the 
practice will develop. Skills in, habits of, and appropriate attitudes toward 
reflective practice introduced in the preservice program must be fostered 
throughout the formative years. The student teachers who participated in 
this study were given an opportunity to begin their professional education 
with an emphasis on self-inquiry. From an analysis of how they experienced 
their first year, it is possible to identify several factors or conditions that, 
given appropriate consideration, would have provided them and presumably 
other new teachers with the kind of professional environment that would 
enable and encourage rather than constrain and discourage their develop- 
ment. School leaders preparing to receive new teachers should give con- 
sideration where possible to the following areas: 

- assignment of classes: class size; composition; curricular demands; 
consistency, as much as possible, with the level and subject area in 
which the teacher was prepared; and as far as possible, consistency and 
stability within and across class assignments; 
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- assignment of physical space; and for those new teachers who are not 
provided a classroom of their own, a special attempt to ensure that 
some “real” space is provided so that they can begin to develop a 
professional identity; 


- provision of adequate materials and resources; new teachers have a 
limited base supply on which to build; 


- provision of planning time; new teachers have limited experience in 
planning and for the most part are doing everything for the first time; 


- provision of opportunities for new teachers to gain an understanding of 
the school culture and to become socialized to the norms and routines of 
the school; 


- creating a climate of support with informal or formal designation of 
support teachers; 


- provision of opportunities for new teachers to form peer support groups 
and to meet regularly with other new and relatively new teachers; and 


- provision of opportunities for new teachers to spend time with students 
in non-teaching situations. 


Note 


1. Aversion of this paper was presented at the annual conference of the Canadian Society 
for the Study of Education, Quebec City, Canada, June 1989. 
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A Comparative Study of 
Student Teachers’ Feelings of 
Preparedness to Teach 


Increasingly, the importance of a learner’s feeling of self-efficacy with respect to a 
task is being recognized as an important prerequisite of success. This study 
focuses on student teachers’ feelings of preparedness to perform tasks central to the 
role of the teacher and measured on the PREP Scale (Housego, 1990). Student 
teachers’ feelings of preparedness to teach in the first term of a new teacher 
education program, September to December 1987, were compared with those of 
their counterparts at comparable points in the old program, September 1986 to 
April 1987. Feelings of preparedness to teach were monitored in secondary subject 
specialization, elementary program specialization, and gender-based as well as 
level of preparation (primary, intermediate, or secondary) subgroups. The New- 
man-Keuls method of pairwise comparison between means was used to determine 
which contrasts between subgroup means were significant. Both elementary and 
secondary student teachers’ feelings of preparedness to teach increased signifi- 
cantly during the first term of both the new and the old teacher education pro- 
grams. At the end of the first term, however, new program student teachers’ 
feelings of preparedness to teach were significantly greater than those of old 
program student teachers at a comparable point in their program. At this point, 
new program student teachers’ feelings of preparedness to teach were at a level not 
significantly different from that reached by old program students near the end of 
their teacher education program. For four of 10 new program subgroups—secon- 
dary art/music, secondary English/social studies/languages, elementary multicul- 
tural education, and elementary males—feelings of preparedness to teach did not 
increase significantly during first term. Contrasts were drawn with old program 
subgroup findings. Possible implications of the findings are discussed. 


One of the important prerequisites of successful teaching is preparation and 
confidence in one’s preparedness. This study compares the feelings of pre- 
paredness to teach of student teachers in a new teacher education program 
with those of their counterparts at comparable points in the old program. 
What does feeling prepared mean? Words like confidence and readiness 
come to mind. Bandura (1986, p. 391) described the “judgment of one’s 
capacity to accomplish a certain level of performance” as an important 
individual personal characteristic or disposition which he labeled “self-ef- 
ficacy,” a term also probably related to feeling prepared. In an earlier work 
(1981, pp. 205-209), he explained that self-efficacy may stem from enactive 
sources (based on performance achievements), vicarious sources (based on 
observing models), persuasory sources (based on verbal persuasion by oth- 
ers), and emotive sources (based on physiological arousal). In his view, enac- 
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tive sources are most important; however, subjects with little experience who 
therefore lack direct knowledge of their capacities may derive self-efficacy to 
a greater degree from vicarious sources. 

Self-efficacy, one’s judgment about one’s capacity to perform particular 
behaviors, is a personal characteristic or disposition which functions in a 
reciprocal deterministic relationship with both the behavior itself and the 
environment in which it occurs (Bandura, 1978). Explaining this statement 
and applying it to a student teacher’s experience, self-efficacy with respect to 
performing teaching behaviors is a personal disposition which is modified by 
and also modifies his or her teaching behaviors. Self-efficacy also affects and 
is affected by the educational environment in which the student teacher 
functions. Theoretically, greater feelings of self-efficacy with regard to teach- 
ing lead to improved teaching behaviors which in turn contribute to a richer 
teaching and learning environment. As well, proceeding in the opposite 
direction, a richer educational environment may stimulate a broader, more 
effective set of teaching behaviors to which pupils may respond with impro- 
ved motivation and achievement and thereby augment a student teacher’s 
self-efficacy regarding teaching, a personal cognitive disposition. Each com- 
ponent in the three-part model plays a role in shaping the other two com- 
ponents. According to Bandura (1978, p. 345), psychological functioning 
“involves a continuous reciprocal interaction between behavioral, cognitive 
and environmental influences.” Theoretically, therefore, self-estimates of 
one’s capacity to perform specific teaching tasks can be seen as an important 
cognitive influence on teaching behaviors and on the educational environ- 
ment. Bandura (1977, p. 191) hypothesized that self-efficacy “determines 
whether coping behavior will be initiated, how much effort will be expended, 
and how long it will persist in the face of aversive experiences.” This hypo- 
thesis has been interpreted, tested, and supported in a variety of educational 
contexts (Ashton & Webb, 1986). 

Student teachers might be expected to derive self-efficacy feelings regard- 
ing teaching from successful teacher education experiences—experiences 
which provide opportunities to tap all four types of self-efficacy sources: 
enactive, vicarious, persuasive, and emotive. Were this so, they would be 
unlikely to say, as many practicing and student teachers have, that their 
teacher education programs did not serve them well or that only the field 
component was really relevant (Carter & DiBella, 1982; Lanier & Little, 
1986). In this study a student teacher’s self-efficacy regarding teaching, his 
or her “judgment of [his or her] capacity to accomplish a certain level of 
performance,” (Bandura, 1986, p. 391), is described in terms of feelings of 
preparedness to teach, estimates of how well prepared the student teacher 
feels to perform a selection of tasks for which teachers are responsible. Being 
motivated is presumably an aspect of feeling prepared as well as a part of 
self-efficacy. The degree of emphasis may vary. 

Feeling prepared to teach might be assumed to be important to the 
execution of successful teaching, just as feeling prepared to sing or to run a 
marathon is important to the successful completion of those behaviors. From 
the perspective of Atkinson’s (1964) expectancy motivational theory, feeling 
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prepared to teach can be seen as a perceived probability of success. In 
Atkinson’s equation the perceived probability of or chance of success is 
multiplied by the incentive value of success to determine motivation. Feeling 
prepared to teach can therefore be seen to be an important aspect of the 
motivation of student teachers along with the incentive value attached to 
teaching. A student teacher’s acquisition of feelings of preparedness to teach 
can be explained in the same manner as the acquisition of feelings of self-ef- 
ficacy (Bandura, 1981). Opportunities to teach in laboratory settings and in 
classrooms provide an enactive source of feelings of preparedness to teach. 
When a student teacher observes others teaching and envisions himself or 
herself in that role, experiencing its rewards indirectly, the experience is a 
vicarious source of feelings of preparedness to teach. Persuasive sources of 
feelings of preparedness to teach come into play when the student teacher is 
literally persuaded by others that he or she can be a successful teacher. In a 
teacher education program, faculty advisors, sponsor teachers, and fellow 
students provide such encouragement. Finally, student teachers tap emotive 
sources of feelings of preparedness to teach. There is an excitement (often 
anticipation, sometimes fear) produced by an approaching period of school 
practice as well as happiness over success. 

In the context of a teacher education program it is possible to promote the 
development of student teachers’ feelings of preparedness to teach, capitaliz- 
ing on all four types of sources. For example, because enactive sources are the 
prime sources of feelings of preparedness, it is necessary to ensure that 
student teachers have successful laboratory and field experiences. Questions 
as to the appropriate type of teaching assignment, the duration of the assign- 
ment, the sensitive supervision of field experience (Housego & Boldt, 1985), 
and the compatibility of student teachers and their supervisors must be 
addressed. If a student teacher has opportunity to observe effective teaching, 
vicarious sources of feelings of preparedness to teach will be richer. The need 
for effective faculty teachers and sponsor teachers to serve as role models 
becomes apparent. Persuasive sources of feelings of preparedness must also 
be evaluated. More persuasion, unless it is warranted and in the form of 
genuine feedback, is not necessarily better. What about mixed messages? A 
coordinated approach to providing feedback may be sought, for example, in 
the form of three-way supervisory conferences (sponsor teacher, faculty 
advisor, and student teacher). Thinking of emotive sources of feelings of 
preparedness to teach, one recognizes the importance of a smoothly operat- 
ing, well-organized teacher education program in which expectations are 
clearly indicated and communication is generally good, thus avoiding the 
frustration experienced by students who don’t know what to expect, how and 
when to prepare, and how they will be evaluated. 

University of British Columbia teacher education programs have recently 
been revised in order to improve, standardize, reorder, and lengthen the 
educational experiences of student teachers and streamline the administra- 
tion of programs. As shown in Table 1, program modifications include a 
generic teaching skills course with large-group lecture and small-group lab- 
oratories, a developmental practicum sequence culminating in a 13-week 
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Table 1 
Brief Description of Curriculum for Two Programs 


Old Programs New Programs ™ 
Secondary and Elementary Secondary Elementary 
Courses (some extend beyond one term) Courses Courses 
Educational Psychology (development, Principles of Teaching same 
ee learning and measurement) (lecture and lab) 
0) 
A Educational Studies Elective (history Analysis of Education same 
' philosophy, sociology or social 
5 foundations). : Education during Application of 
os Adolescent Years developmental 
ce) Education Electives theories 
a Curriculum and Instruction Curriculum and Communication 
a elementary - all subjects Instruction - major skills(2 weeks) 
a secondary - concentrations and minor areas 
October (3 weeks practicum) 10 weekly half days 10 weekly half 
& 2 week practicum days 
coursework continued 
Courses Courses 
Communication skills Curriculum & 
(2 weeks) Instruction 
courses in all 
= February (4 week practicum) areas 
a, 
= 2 week orienta- 
! tion practicum 
¢ 
a Educational 
iy psychology 
“ig (early or 
F middle child- 
& hood) 
13 week practicum 
(Jan-Apr.) 
May (3 week practicum) 2. Courses 
B School organization 13 week pract- 
ae, icum (Sept.- 
ea Learning & Measure- Dec.) 
Te ale ment 
oo 
ia PS Educational Studies 
E Elective 
a 
eS Other electives 4th term (Jan.- 


Apr.) 

similar to 
secondary 3rd 
term 


Term 3 Sec. 


*Some changes have been made in these designs since 1988. 


extended practicum, time to permit a laboratory approach in curriculum and 
instruction courses, and a communications course. A program to prepare 


faculty and sponsor teachers as supervisors of student teachers has also been 
implemented. 
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Program revision was carefully planned by an across-departments faculty 
committee, debated and approved by the whole faculty, and subjected to the 
scrutiny of the wider university community, practicing teachers and ad- 
ministrators, and their professional organizations. All the program changes 
might be expected to enhance student teachers’ feelings of preparedness to 
teach. Increased feelings of preparedness to teach on the part of clientele in a 
teacher education program may be seen as an indicator, possibly a basic 
indicator, of the successful functioning and/or improvement of a teacher 
education program. In other words, because the intent of program revision 
was to improve the preparation of preservice teachers; to expose them to 
more controlled, standardized, carefully planned, and uniformly supervised 
coursework; and to involve them in more hands-on laboratory type prepara- 
tion for an extended period of time, one might expect them to feel better 
prepared to teach than their predecessors. 

To establish a baseline level of student teachers’ feelings of preparedness 
to teach, students in the last offering of the one-year baccalaureate program 
in 1986-1987 (the old program) were asked to indicate at points prior to each 
of three short practica (October, January, and March) the extent to which 
they felt prepared to perform a set of teaching behaviors (Housego, 1990). 
These behaviors were identified by an interdepartmental faculty committee 
as fundamental to teaching and worthy of inclusion in the first basic course 
in a professional teacher education program. The calendar description for 
this course entitled “Principles of Teaching” includes, “introduction to prin- 
ciples and instructional procedures related to classroom management in- 
structional planning, and the assessment of learning as applicable across 
grade levels and subject matter fields ” (Office of the Registrar, 1987, p. 283). 
The development of feelings of preparedness to teach was monitored in the 
total group and in various program and subject matter specialization sub- 
groups. In addition, the tasks for which students felt most and least prepared 
and the tasks for which feelings of preparedness increased most and least 
over the duration of the program were identified (Housego, 1990). 

Similarly, students in the first offering of the revised program (the new 
program) were asked to estimate their feelings of preparedness to perform 
the same tasks several times during their teacher education program. This 
study reports the development of new program student teachers’ feelings of 
preparedness to teach in the first term, September to December 1987, con- 
trasting, as in the previous study, the scores of program and subject special- 
ization subgroups with one another and making appropriate comparisons, 
when possible, with old program findings. Arising out of a basic question as 
to whether student teachers’ feelings of preparedness to teach were related 
to their teacher education programs, three specific research questions were 
asked: 

1. Did student teachers, on entering the new program, feel as well 
prepared to teach as their old program counterparts at or near the entry 
to their program? 

2. Did new program student teachers feel as well prepared to teach at the 
end of first term as their old program counterparts at a comparable 


22a 


B.E.J. Housego 


point in their program? Were there significant increases in their feelings 
of preparedness to teach during the term? 

3. Were there differences in feelings of preparedness to teach initially or at 
the end of the first term among new program student teacher subgroups 
identified by: 

a. grade level selection (primary, intermediate or secondary)? 

b. secondary subject specialization? 

c. elementary program specialization? 

d. gender? 

Did feelings of preparedness to teach increase significantly for these sub- 
groups during the term? Were there similar findings for old program 
subgroups? 


Method 
Subjects 


Elementary and secondary student teachers enrolled at the University of 
British Columbia in the new teacher education program participated in this 
study from September to December 1987. All the secondary subjects were 
graduates of other faculties, and most of the elementary subjects also held 
degrees, although they were admissible with only three years of prior arts or 
science coursework. 

The secondary student teachers’ specialization areas, on the basis of 
which feelings of subgroups’ preparedness to teach were compared, included 
mathematics, science, business education, home economics, English, social 
studies, language, music, art, and physical education. Elementary student 
teachers were not identified as subject matter specialists, but by program 
specialization. They all enroll in either the early childhood (primary) or the 
middle childhood (intermediate) program, and, in addition, they may or may 
not choose to pursue programs in special education, multicultural education, 
or French education. Feelings of preparedness to teach were compared across 
secondary subject specialization subgroups, elementary program specializa- 
tion subgroups, levels of preparation subgroups (primary, intermediate, and 
secondary), and elementary and secondary gender-based subgroups. 

Table 2 shows the total number of responses in each of two data collec- 
tions from secondary student teachers identified by subject matter specializa- 
tion and by gender. Table 3 shows the total number of responses in the two 
data collections from elementary student teachers identified in two ways: by 
program specialization or lack thereof (one group is a no-specialization sub- 
group), and by gender. 

In September 90.5% and in December 88.4% of the elementary student 
teachers participated. Secondary participation was lower, 84.6% in Septem- 
ber and 78.1% in December. 


The Scale 


The Student Teachers’ Feelings of Preparedness to Teach Scale (PREP 
Scale) was designed to measure the degree to which student teachers felt 
prepared to perform a set of tasks outlined by an 11-member interdepart- 
mental faculty committee as core in a teacher education program and ap- 
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Table 2 
New Program Secondary Student Participation by Subject Matter 
Specialization and Gender: September and December 1987 


Month 

Subject Specialization September December 
Math, Science, Bus. Ed., or H.Ec. 40 29 
English, Social Studies, Languages 21 20 
Art or Music 13 ‘fs 
Physical Education 9 9 
Total 83 73 
Gender 

Female 38 38 
Male 45 35 
Total 83 73 


plicable across grade levels and subject matter fields. It consists of 43 items, 
each stated so as to complete the sentence, “I feel prepared to ...” 

In repeated administrations the scale has been found to be highly reliable 
(between 0.95 and 0.97 based on Hoyt’s coefficient, an index of item homo- 
geneity) and valid for the purpose of this study (Housego, 1990). In an effort 
to ensure the content validity of the scale, items were developed to address 
the content of the Principles of Teaching course. Also, factor analysis was 
employed to determine how well the items represented the content areas of 
the course. In order to be sure the items discriminated among respondents 
perceiving varying degrees of feelings of preparedness to teach, their dis- 


Table 3 
New Program Elementary Student Participation by Program 
Specialization and Gender: September and December 1987 


Month 

Subject Specialization September December 
Early Childhood (Primary) 32 30 
Middle Childhood (Intermediate) 54 49 
Total 86 84 
No Specialization (beyond above) 43 43 
Special Education 13 13 
Multicultural Education 19 16 
French Education 11 (WV 
Total 86 84 
Gender 

Female 63 58 
Male 23 26 
Total 86 84 
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crimination indices (item-total score correlation coefficients) were scrutiniz- 
ed. Eighty-eight percent exceeded .50. 


Administration and Scoring 


Student teacher participation in this study was requested and data were 
collected during the first class of the Principles of Teaching course in Sep- 
tember 1987 and in one of the last class meetings in early December 1987. 

The 43 PREP Scale items were scored on a five-point scale anchored on 
one end by “very well prepared” and on the other end by “very poorly 
prepared.” Responses were assigned from 1 to 5 points, low scores indicating 
greater preparedness. The scores reported in Tables 4 to 8 are the averages 
over the 43 items for particular groups and subgroups of student teachers at 
various points in their teacher education programs. 


Report and Discussion of Findings 
The findings of this study are presented in six subsections: Whole Group: 
Program Entry Findings (September); Whole Group: First-Term Findings 
(December); Comparison of Primary, Intermediate and Secondary Sub- 
groups; Comparison of Secondary Specialization Subgroups; Comparison of 
Elementary Program Specialization Subgroups; and Comparison of Gender 
Subgroups. 


Whole Group: Program Entry Findings 


As a whole group, new program student teachers on entry to the program in 
September felt as well prepared to teach as old program student teachers did 
at acomparable point. Table 4 shows the PREP mean score at 3 points in the 
old program and the beginning and end of the first term of the new program. 
In answers to research question 1, there was no significant difference be- 
tween the mean PREP score of new program student teachers in September 
and that of old program student teachers in early October (F=.20, df=1/384, 
=) 

When the new program began, secondary student teachers felt adequately 
(midpoint on the 5-point scale) or more than adequately prepared to perform 


Table 4 
Mean Scores on PREP Scale in Two Programs 
Old Program New Program 
Month Mean* Month Mean* 
October 125.8 (21.4)+ September 124.8 (22.9) 
January TOA mee) December 104.3** (20.8) 
March 99.5 (25.2) 


+ Standard deviations are included in parentheses. 

*The differences between the mean scores within each program are significant at @ = .01 level, 
smaller scores indicating greater feelings of preparedness. 

**At the @ =.05 level, this score is significantly different from the January score but not 
significantly different from the March score. 
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the tasks presented in 24 of 43, and elementary student teachers for the 
tasks presented in 22 of 43 scale items. A similarly high level of confidence 
was expressed by old program student teachers who felt adequately or more 
than adequately prepared to perform 28 or 43 tasks by early October. Book, 
Byers, & Freeman (1983) also found that student teachers generally exhibit 
a somewhat surprising level of early confidence in their abilities to teach. One 
might expect inflated estimates of preparedness for activities with which 
people have limited familiarity (Bandura, 1986). That is, in both the old and 
the new programs, feelings of preparedness to teach may have been overes- 
timated because they drew more on vicarious than on enactive experiences, 
although some students do enter the faculty with community teaching or 
supervisory-type work experience with children or adolescents, and everyone 
has gone to school. 


Whole Group: First Term Findings 


By the end of December, new program student teachers felt significantly 
more prepared to teach than their counterparts in the old program felt in late 
January. Answering research question 2, the December mean PREP score 
for student teachers in the new program was significantly different from the 
January mean PREP score of old program student teachers (F=6.37, 
df=1/296, a=.05). What is more striking, the December mean PREP score of 
new program student teachers was not significantly different (F=3.35, 
df=1/296, a=.08) from that of old program student teachers in March when, 
except for a final three-week practicum, they had completed their teacher 
education programs. In other words, new program student teachers felt 
almost as well prepared to teach after three months as old program student 
teachers did after more than six months. 

One might speculate as to why such differences across programs oc- 
curred? The new program design attempts to ensure a more consistent, 
thorough coverage of basic teaching tasks in lecture and in laboratory prac- 
tice and discussion. One might interpret this as an attempt to create and 
utilize enactive as well as vicarious sources of feelings of preparedness to 
teach. Both programs featured some early school experience. In the old 
program, student teachers engaged in a three-week practicum with expecta- 
tions that they achieve 50% teaching time. In the new program, student 
teachers spent 10 half-days in schools either mentoring an individual or a 
small group in a learning enrichment mode, or tutoring small groups in the 
regular classroom on a weekly basis. It would be difficult to say that one or 
the other practical experience would be more likely to produce greater in- 
creases in student teachers’ feelings of preparedness to teach. The old pro- 
gram provided a slightly greater amount of early classroom experience, 
whereas working with individual learners or small groups may have enabled 
new program student teachers to be more reflective than old program stu- 
dent teachers were able to be during whole-class instruction. Further, all of 
the old program student teachers completed some methods courses before 
the January data collection point. This was also true of the secondary new 
program, although not of the elementary new program student teachers. To 
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the degree that methods courses enhance feelings of preparedness to teach, 
the elementary new program student teachers might be seen to have had less 
opportunity to develop such feelings. 


Comparison of Primary, Intermediate, and Secondary Subgroups 


Do primary student teachers feel more prepared to teach because they teach 
less difficult concepts? Do secondary student teachers feel more prepared to 
teach because they work largely in subject areas in which they hold degrees? 
Research question 3a focuses on differences in feelings of preparedness to 
teach across subgroups of student teachers by grade level of preparation. 

For ease of comparison of subgroups based in turn on level of preparation, 
subject specialization, program specialization, and gender, it was decided to 
carry out in each instance a one-factor, fixed-effects analysis of variance 
permitting comparisons of subgroup means both in months and across 
months. 

Using the Newman-Keuls method of pairwise comparison between 
means, it was determined that there were no significant differences at the 
a=.05 level in the September means or the December means (F=15.23, 
df=7/488, p<.001) of subgroups of student teachers based on grade levels of 
preparation, (elementary, including primary and intermediate, and secon- 
dary). There were significant and somewhat uniform increases in feelings of 
preparedness to teach for each of the four subgroups across the first term 
(from September to December). These might be attributed to the common 
experiences, redesigned standardized courses, laboratory and field type com- 
ponents, and systematic feedback provided in the program. 

Because no analysis of feelings of preparedness in the old program was 
made according to grade level of preparation, no program comparisons were 
made. 


Comparison of Secondary Subject Specialization Subgroups 


Is it a different task to teach mathematics or science than it is to teach in the 
area of the humanities or fine arts? Would a specialist in one of these areas be 
likely to feel more prepared to teach than a specialist from another? Secon- 


Table 5 
New Program Mean + Scores on PREP Scale by Level of Preparation 
Month 

Level of Preparation September December 
Primary* 128.8  (19.6)** 104.9 (21.0) 
Intermediate* 124.5 (27.9) 106.0 (20.3) 
Elementary (Total)* 1208) eeoe) 105.6 (20.5) 
Secondary* 123.4 (20.4) 102.9 (21.3) 


+ Smaller scores indicate greater feelings of preparedness. 

*A comparison of means showed that the differences between September and December 
means for these groups were significant at less than @ =.01 level. 

**Standard deviations are included in parentheses. 
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Table 6 
New Program Mean Scores + on PREP Scale by 
Secondary Subject Specialization Subgroups 


Month 
Subject Specialization September December 
1. Math, Science, Bus. Ed., or H.Ec.* 125:64 u(1.827)%4 105.995) (20:2) 
2. English, Social Studies, Languages 118.4 (23.1) 102.2 (22.9) 
3. Art, Music 119.4 (20.5) 102.3 (24.2) 
4. Physical Education* 131.4 (20.2) 95.8 (17.3) 


+ Smaller scores indicate greater feelings of preparedness. 

*A comparison of means showed that the differences between the September and December 
means for these groups were significant at less than the @=.01 level. 

**Standard deviations are shown in parentheses. 


dary subject matter specialization subgroups 1, 2, 3, and 4 included, respec- 
tively, mathematics, science, home economics, and business education stu- 
dents; English, social studies, and languages students; art and music 
students; and physical education students. The combination of some subject 
areas increased the size of subgroups for comparative purposes (home eco- 
nomics and business education students were few in number). Also, the 
combinations paralleled the groupings in the previous study (Housego, 1990) 
and represented, to a large degree, the organization of departments in the 
faculty. 

Using the Newman-Keuls method of pairwise comparison between 
means, it was determined that there were no significant pairwise differences 
at the a=.05 level in the September means or the December means (F'=6.11, 
df='7/148, p=<.001), indicating therefore, in response to research question 
3b, that no secondary subject specialization subgroup felt significantly more 
or less prepared to teach than any other, either at the outset of the program 
or at the end of the first term. Again, this might be at least partially at- 
tributed to the previously described consistencies of the new program. In the 
old program, secondary art and music student teachers felt more prepared to 
teach early in the program than their counterparts in mathematics, science, 
business education, and home economics, and at the end of the program they 
felt more prepared to teach than their counterparts in English and social 
studies. 

There were, however, significant differences between the September 
mean and the December mean for subgroups 1 and 4, indicating that those 
two subgroups, mathematics/science/business education/home economics, 
and physical education, respectively, increased significantly in feeling pre- 
pared to teach in first term. This was not so for subgroups 2 and 3, English/ 
social studies/languages and art/music. In September these two subgroups 
comprised 41% and in December 48% of the total secondary group. It is 
difficult to know why they did not come to feel increasingly well prepared to 
teach in that period and particularly why art and music student teachers, 
who in the old program felt more prepared to teach both initially and at the 
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end of their program than some of the other subgroups, should have been in 
that category. Some art and music student teachers and faculty expressed 
concerns about the difficulty of applying the generic skills course content on 
planning and instruction in their fields. Perhaps student teachers in these 
areas were exposed to some conflicting points of view in subject specific 
methods courses. 

By contrast to other subgroups of student teachers, the physical education 
subgroup made impressive gains in feeling prepared to teach. Its PREP Scale 
mean score increased by nearly a full point on a five-point scale as contrasted 
to about half that amount for other subgroups. The generally positive re- 
sponse of pupils to physical education may stimulate greater feelings of 
preparedness in student teachers, an example of the functioning of the 
previously explained reciprocal deterministic model (Bandura, 1977). Also, in 
their first degrees or in community work, physical education students often 
engage in teaching and, therefore, may be able to draw on enactive sources of 
feeling prepared to a greater degree than student teachers who have no 
previous teaching experience. 


Comparison of Elementary Program Specialization Subgroups 


Questions can be raised regarding the possible relationship between program 
specialization as well as subject matter specialization and feelings of pre- 
paredness to teach. Elementary student teachers had the option of enrolling 
in one of three programs: special education, multicultural education, or 
French education. About half of them chose to do so; the remaining students 
constituted a no-specialization subgroup. 

Again, on the basis of a one-factor, fixed-effects analysis of variance 
(F=6.04, df=7/162, p<.001), the null hypothesis of no significant differences 
among subgroup means was rejected at the a=.05 level of significance. Using 
the Newman-Keuls method of pairwise comparison among means, it was 
determined that there were no pairwise differences significant at the a=.05 
level in the September or the December mean scores shown in Table 7, thus 
indicating in response to research question 3c, that no new program elemen- 
tary program specialization subgroup felt any more prepared to teach than 
any other, either initially or at the end of first term. There were, however, 
significant differences between the September and the December means for 
three of four subgroups, the exception being the multicultural education 
subgroup, which did not experience a significant increase in feeling prepared 
to teach during the term. This result was similar to that found in the old 
program in which multicultural education student teachers, both midway 
through and near the conclusion of their program, felt significantly less 
prepared to teach than their counterparts in the no-specialization subgroup. 
In both the new and old programs, elementary multicultural education stu- 
dent teachers may have felt somewhat discouraged by the enormity of the 
task of meeting the needs of so many cultural groups in school settings in 
which class sizes were large and resources limited. One Vancouver high 
school, for example, enrolled students with 54 first languages other than 
English. The multicultural education program draws together students who 
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Table 7 
New Program Mean Scores + on PREP Scale by 
Elementary Program Specialization Subgroups 


Month 
Program Specialization September December 
1. Special Education* 135.69 {23,0) =" 106.8 (23.9) 
2. Multicultural Education 124.9 (23.5) 113.1 (24.6) 
3. French Education* 134.4 (27.7) 103.3 (13.9) 
4. No Specialization* 216m 25:1) 1031 (19:2) 


+ Smaller scores indicate greater feelings of preparedness. 

*A comparison of means showed that the differences between the September and December 
means for these groups were significant at less than the @ =.01 level. 

**Standard deviations are shown in parentheses. 


have an interest in working with children from various cultural backgrounds. 
Perhaps these student teachers estimated their feelings of preparedness to 
teach in the context of working primarily with such pupils and therefore 
envisaged greater difficulty. They were about to begin second term, subject- 
specific methods courses in which their special needs would be addressed in 
exclusively multicultural education sections. Perhaps feelings of prepared- 
ness to teach could be expected to increase to a greater degree in second term. 


Comparison of Gender Subgroups 


Female teachers have been found to express higher personal teaching ef- 
ficacy than male teachers (Cavers, 1988; Evans & Tribble, 1986). These 
findings prompted speculation about a possible relationship between gender 
and feelings of preparedness to teach, although the same question was not 
asked regarding old program student teachers. Research question 3d asked 
whether one gender subgroup of elementary or of secondary student teachers 
felt significantly more prepared to teach initially or at the end of first term 
and whether both male and female elementary and secondary student teach- 
ers increased significantly in feeling prepared to teach during the first term. 

Once more, on the basis of a one-factor, fixed-effects analysis of variance, 
(F=11.01, df=7/318, p<.001), the null hypothesis of no significant differen- 
ces among subgroups of means was rejected at the a=.05 level of signif- 
icance. Using the Newman-Keuls method of pairwise comparison among 
means, it was determined that there were no significant pairwise differences 
at the a=.05 level in the September means or in the December means 
displayed in Table 8, thus showing that no elementary or secondary gender 
subgroup of student teachers felt more or less prepared to teach than any 
other at the outset of the program or at the end of first term. This consistency 
did not hold with regard to increases in feeling prepared to teach during the 
term. Only gender subgroup 4, the elementary male student teachers, did not 
experience a significant increase in feeling prepared to teach in first term. 
Perhaps this finding can be linked to the impression that elementary teach- 
ing is a more feminine than masculine occupation (Apple, 1987; Rosenfeld & 
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Table 8 
New Program Mean Scores* on PREP Scale by Gender Subgroup 
Month 

Gender Subgroup September December 
1. Secondary Female* 124.2 (19.1)** 102.5.) (22:0) 
2. Secondary Male* N22 Baie 17) 103.3. (20.9) 
3. Elementary Female* 129.1 (24.3) 104.5 (21.0) 
4. Elementary Male 118.0 (25.9) 108.0 (19.4) 


+ Smaller scores indicate greater feelings of preparedness. 

*A comparison of means showed that the differences between the September and December 
means for these groups were significant at less than the @ =.01 level. 

**Standard deviations are shown in parentheses. 


Stephan, 1978). Depending on how male elementary student teachers view 
“women’s work” and how they estimate their capacity to do it, one might 
expect their feelings of preparedness to vary. Both initial overestimates of 
feelings of preparedness and real difficulty in performing the tasks could 
have contributed to the no gain result. 


Limitations 

Measuring student teachers’ feelings of preparedness to teach does not yield 
answers to questions regarding their actual teaching performances. Nor does 
it provide information concerning possible related improvements in pupil 
achievement; indeed, a student teacher’s feeling prepared to teach does not 
ensure that he or she is well prepared to teach. The foregoing are limitations 
of this study, some of which are being addressed as the project proceeds. For 
example, PREP Scale scores are being matched with practicum evaluation 
records of teaching strengths and weaknesses. In this study, few student 
teachers felt poorly prepared with respect to any scale item. It seems that the 
lower level of feelings of preparedness to teach may have limited application. 
It may be that student teachers with limited classroom experience lack the 
standards and knowledge regarding effective teaching which would enable 
them to understand the meaning of being poorly prepared or very poorly 
prepared to teach. 

There were at least two problems in establishing parallelism on the basis 
of which to compare the old with the new teacher education programs. First, 
the times of data collection were not entirely synchronized. In the previous 
program, student teachers were asked to estimate their feelings of prepared- 
ness just prior to each of three short practica (October, January, and March). 
In the present study, estimates of preparedness to teach were collected in 
early September, at the beginning of the program, and in early December at 
the end of the first term. Comparisons were made between new program 
September estimates and old program October estimates and between new 
program December findings and old program January findings. This was not 
a perfect fit; however, any advantage in the comparison process was in the 
direction of the old program. 
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A second problem related to the similarity of the participants in the two 
programs. Student teachers in the present study were graduates of other 
faculties, as were the subjects of the previous study. It is not known whether 
the offering of a new teacher education program, or for that matter, the final 
offering of an old and shorter teacher education program attracted students 
who were in some manner unique. 

Finally, there were limitations concerning the generalizability of the find- 
ings of this study to other subjects in other locations and at other times. 


Summary of Findings 

In summary, the three research questions were answered as follows: 

1. There were no significant differences in feelings of preparedness to 
teach between the total groups of new and old program student teachers 
at or near the entry to their respective programs. 

2. At the end of first term, new program student teachers felt significantly 
more prepared to teach than old program student teachers did, with an 
additional month of teacher education and a three-week practicum. 
During first term, like their old program counterparts, they experienced 
a significant increase in feeling prepared to teach. 

3. No subgroup of new program student teachers felt significantly more 
prepared to teach than any other related subgroup either initially or at 
the end of first term. Among old program subgroups, however, secon- 
dary art/music and also English/social studies student teachers felt sig- 
nificantly more prepared to teach than student teachers in some other 
subgroups at the time of the first data collection. Old program elemen- 
tary multicultural student teachers also felt less well prepared to teach 
than some other subgroups. 

Feelings of preparedness to teach did not increase during the first term 
for four subgroups of new program student teachers: secondary English/so- 
cial studies/ languages; secondary art/music; elementary multicultural edu- 
cation; and elementary males. Old program multicultural education student 
teachers also experienced no significant increase in feeling prepared to teach 
between the first and second data collections. 


Implications for Teacher Education 
One implication of this study related to the teacher education program 
changes which were implemented at UBC. It may be speculated that the 
significantly improved feelings of preparedness to teach evident in the total 
group of new program student teachers by the end of first term, and the fact 
that they felt almost as well prepared to teach after three months as the old 
program student teachers did after six months of teacher education, com- 
prise a modest affirmation of the teacher education program changes which 
have been made: the addition of a generic teaching skills course; increased 
laboratory-type, hands-on experiences; a developmental practicum beginning 
with half-days of working with individuals or small groups of learners; a 
communications course; and faculty and sponsor teacher training in the 
analysis of teaching and the supervision of student teaching. Although it may 
be suggested that this difference between feelings of preparedness to teach in 
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one program as contrasted with the other can be accounted for by a variety of 
other factors, the case for its being attributable to the previously described 
substantial program changes seems as credible if not more credible than 
most. For example, the claim that the finding is due to the newness of the 
program, a so-called halo effect, can be countered with the equally plausible 
argument that because the changes did result in some first-run administra- 
tive difficulties and there were mixed reactions to the program from the field 
and from faculty members, the production of a “horns effect” was equally 
likely. Students were regularly asked to critique their experiences and possib- 
ly thereby encouraged to see themselves as guinea pigs, expect problems, and 
become more anxious. At the very least, though, student teachers appeared to 
be assigning some face validity to their experiences, and greater confidence 
in their abilities to teach may enable them to accept and reflect on the 
feedback provided by sponsor teachers, faculty advisors, and pupils and make 
appropriate changes in their teaching behaviors. 

The importance of focusing on specific teaching skills in a teacher educa- 
tion program is a further implication underscored by Green and Miklos 
(1987, p. 31) who recently studied the responses to their teacher education 
programs of Alberta teachers with two or three years of experience, finding 
that fewer than 10% of the subjects identified the development of specific 
teaching skills as a program strength, while more than 25% identified lack of 
emphasis on the development of specific teaching skills as a program weak- 
ness. Perhaps the subgroups of UBC student teachers who did not come to 
feel increasingly well prepared to teach in a term might have done so had 
more attention been directed to the acquisition of specific skills, especially in 
the laboratory accompanying the generic skills course. 

A further implication relates to student course load. Despite the fact that 
secondary student teachers were enrolled in specific subject methods courses 
in first term and elementary student teachers were not, more secondary 
students did not increase in feeling prepared to teach. Their full calendar 
year program may have been more concentrated and anxiety producing than 
the elementary program which extends over two university years. 

Another implication of this study relates to the situation of male elemen- 
tary student teachers. Their lack of progress in coming to feel better pre- 
pared to teach during the first term raises questions about their responses to 
teacher education. It is widely recognized that many children do not have as 
much access to male role models as might be desired in their socialization. 
Certainly few of their primary teachers are male. Strategies need to be 
developed for making male student teachers who express an interest in 
primary teaching feel that it is indeed an appropriate vocational choice for 
men. 

In conclusion, the findings of this study may be interpreted as underscor- 
ing the need for teacher educators to establish a sense of belonging to, 
sharing the goals of, and supporting their particular programs and com- 
municating these feelings to student teachers. Similar factors have been 
shown to influence the achievement of students in both elementary and 
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secondary schools (Mortimore & Sammons, 1987; Rutter, Maughan, Mor- 
timore, & Ouston, 1979). 
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A Comparison of Average and Poor 
Readers in Their Recall Performance, 
Strategy Use, and Metamemory 


Two experiments were performed to examine the extent to which spontaneous 
cognitive strategic behavior and metacognitive knowledge may be implicated in 
reading. The subjects were 25 average readers and 23 poor readers at the grade 4 
level. Experiment 1 employed a multiple classification design to compare the recall 
performance of the two groups under intentional and unintentional memory 
conditions. The only significant effect was the memory condition main effect, 
confirming the superiority of conscious strategic processing in memory function- 
ing but revealing no between-group differences in inferred management behavior. 
In experiment 2, strategic behavior and general metamemorial knowledge were 
assessed directly in addition to recall performance. While the average achievers 
were relatively superior in strategy use and metamemory, both groups of children 
manifested a production deficiency and a generally low level of strategic sophis- 
tication. Across groups, significant intercorrelations were found among strategy 
use, metacognitive knowledge, and recall performance, although among the poor 
readers the metacognition-recall performance link was weak. 


Understanding the basic components of the learning process is a necessary 
prerequisite to understanding individual differences in learning. Memory 
and strategic behavior are two major components of the learning process. 
The ability to remember information is essential for learning to take place; 
consequently, the role of strategic behavior in enhancing memory and learn- 
ing in general has received increased attention among psychological re- 
searchers over the last two decades. For example, Torgesen and his 
associates (e.g., Torgesen, 1977a; Torgesen & Goldman, 1977; Torgesen & 
Licht, 1983) have suggested that some children are less successful achievers 
than their basic intellectual ability would suggest because they are either 
unable to develop and use efficient task strategies, or they lack the inclina- 
tion to do so. As theorizing and research in the field of cognitive information 
processing have advanced, there have been suggestions that the use of effi- 
cient strategies may itself be related to one’s knowledge and awareness about 
the nature of the cognitive activity and of the person and task variables 
which influence such activity (e.g.. Borkowski, 1985; Brown, 1978; Flavell, 
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1978; Flavell & Wellman, 1977; Gibson & Levin, 1975; Snowman, 1986; 
Waters, 1982). This development has stimulated interest in assessing the 
separability of cognition and metacognition (e.g., Slife, Weiss, & Bell, 1985) 
and in investigating the tripartite relationships among cognitive perfor- 
mance, strategic behavior, and metacognitive knowledge or awareness (e.g., 
Byrd & Gholson, 1985; Cavanaugh & Borkowski, 1980; Waters, 1982). 

Much of the research comparing good and poor readers on memory tasks 
is based on the conceptual framework that the differences observed between 
these two groups may be the result of differences in general cognitive strat- 
egies underlying not only performance on experimental memory tasks but 
also the attainment of reading and other academic skills (see Torgesen & 
Kail, 1980). Thus the central concern of experimental work on memory in 
populations of children who differ in reading ability actually transcends 
interest in the relationship between memory capabilities and reading profi- 
ciency per se. One ultimate objective of this line of research has been to 
understand the potential connections between deficient or inefficient cogni- 
tive strategic behavior and academic underachievement in general. 

Some of the most frequently cited works in this line of research are briefly 
reviewed here. For example, Kastner and Richards (1974) found in a study of 
grade 3 students’ memory strategies that while equal numbers of good and 
poor readers used the rehearsal strategy when presented with pictures of 
familiar stimuli, only the good readers continued to use rehearsal under a 
novel stimuli condition. In a study involving grade 2 students, Torgesen and 
Goldman (1977) found that even with familiar stimuli poor readers were 
deficient in the use of active strategies like rehearsal. Similar findings re- 
garding poor readers’ deficiency in the use of rehearsal strategies have been 
reported by Bauer (1977) and Tarver, Hallahan, Kauffman, and Ball (1976). 
Differences between good and poor readers in the use of more complex 
strategies have also been reported. In two separate studies, Torgesen and his 
associates found poor readers to be more deficient in the use of categorical 
organization to aid recall (Torgesen, 1977b; Torgesen, Murphy, & Ivey, 
1979). Categorical organization involves grouping the list of items to be 
learned into conceptual categories. 

In the presence of evidence that the strategic behavior differences be- 
tween good and poor readers are significantly diminished when the latter are 
explicitly trained or instructed to use strategies (e.g., Torgesen, 1977b; Tor- 
gesen et al., 1979; Worden, 1983), the foregoing findings have been inter- 
preted as supporting the proposition that the poor performance of poor 
readers on memory tasks is primarily a management problem. By inference, 
the failure to manage intact intellectual resources actively and with planning 
appears to be responsible, at least in part, for academic underachievement in 
populations of children with average intellectual ability. While current the- 
orizing views metamemorial knowledge (i.e., statable knowledge about one’s 
own memory, memory tasks, and about memory strategies and the variety of 
personal qualities necessary to apply them) as having a significant influence 
on such active and planned behavior, the extent to which metamemory 
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accounts for differences in the performance of underachieving and average 

achieving children has been less extensively researched. 

The present study was designed to test the management hypothesis, 
assess the relative metacognitive knowledge level of average and poor read- 
ers, and ascertain the extent to which recall performance, strategy use, and 
metacognition are interrelated. Two separate experiments were performed 
to address these goals. Experiment 1, which was designed as an indirect test 
of the management hypothesis, employed the “central versus incidental” 
memory paradigm to examine the extent to which explicit instruction to 
memorize for later recall made a difference in the recall performance of 
average and poor readers. In this experiment, strategic behavior was not 
measured directly; rather, the design allowed for inferring the role of 
strategic behavior in recall and between-group differences in strategy utiliza- 
tion from the pattern of main and interaction effects obtained for recall 
performance in a multiple classification analysis of variance. It was hy- 
pothesized that if the management thesis was valid, the following two results 
would be obtained: 

1. Across groups, students instructed explicitly to memorize for later recall 
(i.e., intentional memory condition) would perform better than those not 
instructed to do so (incidental memory condition); and 

2. The average readers would show superior performance on the intention- 
al memory task than on the incidental task. 

The rationale for this second hypothesis was that spontaneous development 
and deployment of appropriate strategies (i.e.. management) would be maxi- 
mized under the intentional memory condition, and therefore the better 
managers should show significantly superior performance there than they 
would under the incidental condition. In connection with these hypotheses, 
we expected to find a significant main effect for memory type (intentional 
versus incidental) and a significant memory type x group (average versus 
poor readers) interaction. 

Experiment 2, which was designed as a more direct test of the manage- 
ment hypothesis, examined between-group differences in recall performance, 
strategic behavior, and metacognitive knowledge and attempted to verify the 
interrelationships that have been theorized to exist among these variables. 
All three variables were measured directly, using appropriate instruments or 
tasks. With a few exceptions (e.g., Byrd & Gholson, 1985; Cavanaugh & 
Borkowski, 1980), comparative memory studies have traditionally tended to 
examine recall performance, strategic behavior, and metacognition inde- 
pendently of one another—making it impossible to assess these relationships. 
Two hypotheses were entertained. First, both strategy utilization and meta- 
memorial knowledge would constitute sources of variation between average 
and poor readers. Second, both within and across groups, recall performance 
would be related to strategy utilization and metamemory, and the latter two 
variables would correlate strongly with each other. 

Experiment 2 also addressed one important, albeit minor, methodological 
consideration. Studies examining strategic behavior have generally tended to 
assume that subjects already possess the basic concepts and capabilities 
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necessary for utilizing the relevant strategies. For example, children’s use of 
the categorical organization strategy has been examined frequently without 
testing for evidence of the presence or absence of the ability to see categorical 
relationships in the first place. In the absence of such a test, the failure to 
adopt the categorization strategy cannot always be attributed to a production 
deficiency (i.e., to deficient strategic behavior per se). By ascertaining sub- 
jects’ relative ability to see categorical relationships or sort items into concep- 
tual categories, experiment 2 made it possible to make more conclusive 
comments about the management hypothesis. 


Method 
Subjects 


Forty-eight grade 4 students selected from 10 schools within the Roman 
Catholic School Board for St. John’s, Newfoundland, participated in the 
study. Subjects were chosen from a pool of students scoring between 85 and 
115 on the Canadian Cognitive Abilities Test (CCAT), administered at the 
beginning of the school year by the Board. Reading performance on the 
Canadian Test of Basic Skills, also administered by the School Board at the 
beginning of the academic year, was the basis for assigning subjects to the 
poor reader (n=23) and average reader (n =25) groups. Because group mem- 
bership was determined on the basis of reading proficiency, IQ matching of 
the two groups was based on scores on the nonverbal subtest of the CCAT. As 
Table 1 shows, the mean grade equivalent reading achievement scores for the 
poor and average readers were, respectively, 2.5 and 4.3. Although the groups 
were comparable on nonverbal IQ, they differed significantly on verbal 1Q. 
The selection criteria resulted in the inclusion of more boys (n=31: 15 poor 
readers and 16 average readers) in the study than girls (1=17: 8 poor readers 
and 9 average readers), but both sexes were roughly equally represented in 
the two groups. 


Instruments and Procedures 


Both experiments employed picture rather than word stimuli. The choice of 
picture tasks was made in an attempt to control for the potential confound- 
ing effects that written stimuli could have on the performance of the poor 


Table 1 
Academic' and Cognitive Characteristics of the Sample 


Average Readers Poor Readers 

Mean SD Mean SD F-Value 
Reading 4.3 0.41 a6 0.19 Sonea ces 
Math 4.4 0.33 3.4 0.68 38.88 S45 
Verbal IQ 102.3 7.48 97.4 6.09 6.09 * 
Nonverbal IQ 98.3 9.19 96.5 7.95 0.49 NS 


*D <.01 idle) <.001 
'Grade level scores for reading and math 
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readers. Especially in experiment 2, where one primary focus was the direct 
assessment of strategic behavior, it was crucial to ensure that this assess- 
ment was not confounded by difficulties that poor readers might have decod- 
ing word stimuli. 


Experiment 1 


Line drawings of 15 common objects selected from the grade 2 level of the 
Dolch word list (Dolch, 1955) constituted the stimulus for this experiment. 
Each picture belonged to one of three distinct conceptual categories (means 
of transport: truck, car, train, bus, boat; animals: rabbit, cat, dog, cow, horse; 
and things you eat: candy, pie, bread, ham, apple). Pictures were drawn to 
equal size and were all placed on one page. Four of the objects had missing 
parts (e.g., the car did not have a door), a feature which made it possible to 
present the stimulus to the unintentional group as a search task. The pic- 
tures were located on the page in such a way that they were not in well 
defined columns or rows, and no two objects from the same conceptual 
category were adjacent to each other. 

The task in experiment 1 was the first to be given to subjects in the study 
and was given on a separate day in a group setting. Each student was given 
a sheet of paper with the 15 pictured objects on it. The intentional groups 
were explicitly instructed to study the pictures for later recall, while the 
unintentional groups were instructed only to study the pictures carefully in 
order to identify those with missing parts. After the study period (a maxi- 
mum of 2 minutes), the picture sheets were collected and each student was 
given a paper and pencil and asked to write down the names (labels) of as 
many of the pictures as they could remember without paying any attention 
whatsoever to spelling. The number of items recalled was the dependent 
measure for this experiment. 


Experiment 2 
A picture recall task and a set of metamemory tasks were used in this 
experiment. 

Recall task. The stimulus for the recall task consisted of a second set of 15 
line drawings of common objects representing grade 2 level nouns from the 
Dolch word list (Dolch, 1955). The objects were totally different from those 
used for the group-administered task in experiment 1. Each of the 15 line 
drawings was reproduced on a separate card, with the back of each card 
numbered to enable the examiner to lay the cards out in the same sequence 
for each subject. Again, each of the 15 items belonged to one of three distinct 
conceptual categories. This time the categories were things you wear (glove, 
hat, shoes, crown, and socks); furniture (table, bed, chair, lamp, and desk); 
and body parts (hand, nose, foot, eye, and ear). The task was designed to 
provide the efficient memorizer the opportunity to categorize the items 
during input. 

The recall task was administered individually by the second author with 
the subject sitting at a small table or desk directly facing the examiner. All 15 
cards were laid out face up in front of each subject in the same order with no 
two items from the same category placed in adjacent positions. Subjects were 
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instructed to try to memorize for later recall as many of the items as they 
could, doing anything they wished with the cards to help them remember the 
items. Subjects were asked to notify the examiner as soon as they had 
finished studying the pictures. A maximum time limit of 5 minutes was 
allowed, and each subject was inconspicuously timed, from when all the cards 
had been laid out to when notification regarding completion of the task had 
been received. 

During the study period the examiner observed and made descriptive 
notes on any observable behaviors and/or strategies such as lip movement, 
counting, or scanning. Card manipulation was particularly scrutinized, espe- 
cially with respect to shuffling or organization into meaningful groupings. 
On signalling completion, subjects were given a blank sheet of paper and a 
pencil to write down the names of as many items as they could remember. 
The examiner stressed that spelling was not important, and the examiner 
could be asked for assistance with spelling if subjects found it necessary to be 
able to carry on with the task. 

Following recall, the examiner asked each subject the following three 
questions designed to obtain a self-report of how the subject went about 
memorizing the items: 

1. How did you learn the list? What did you do to be sure that you would 
remember the pictures? What else did you do? Anything else? 

2. Why did you learn the list in that way? 

3. Are there any other ways you could have studied the pictures? 

Following this interview the examiner went through an activity designed 
to ascertain whether subjects’ failure to use categorical organization during 
memorization was due to deficits in conceptual classification ability or to 
difficulties in spontaneously taking strategic advantage of existing skills. The 
pictures were laid out on the table again in the original order. If a subject 
used grouping spontaneously and conspicuously during the study and recall 
period, he or she was simply asked how many categories there were and how 
many items were in each category. Answering these questions correctly 
earned the subject the maximum categorization score of 8 points. Students 
who did not use categorization conspicuously were asked to arrange the 
pictures into three conceptual categories by putting together all pictures 
which were similar (i.e., had something in common) and to label each cat- 
egory. A score lower than 8 was assigned to each student depending on the 
quality of performance. 

A strategy-use scoring key was developed after collating information from 
each subject’s verbal protocols on the three questions above and from the 
examiner’s observational notes. Protocols and notes relating to question 1 
(How did you learn the pictures?) revealed that in all 13 different strategies 
were employed by subjects. After compiling a list of the strategies and their 
descriptions, the two authors and five informed graduate students ranked 
the strategies in order of sophistication. The 18 strategies were grouped into 
five classes with categorical organization (10 points) and random or ordered 
scanning without verbalization/rehearsal (2 points) as the highest and lowest 
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classes respectively. Point values of 8, 6, and 4 were assigned to the middle 
three classes. 

A maximum of 3 points were awarded for question 2 (Why did you learn 
the list in that way?) whereas 1 point was awarded for any one or more 
strategies provided in response to question 3. Finally, a 2-point bonus was 
awarded for spontaneous monitoring or checking behavior such as counting 
the number of items before attempting to commit them to memory and 
verifying the completeness of recall by matching the number recalled with 
the first number. This manner of scoring yielded a maximum strategy index 
score of 16. 

Metamemory task. The metamemory tasks for this study were selected 
from Kreutzer, Leonard, and Flavell’s (1975) interview questionnaire. In 
each of the four tasks (story list, preparation object, retrieval event, and rote 
paraphrase), an orally presented hypothetical situation was used to elicit 
information on various aspects of children’s awareness about their own 
memory. Borkowski and his associates have found these four tasks to be the 
most valid and sensitive indicators of metamemory within the Kreutzer et al. 
interview for children between 6 and 9 years of age (Borkowski & Kurtz, 
1984; Cavanaugh & Borkowski, 1980). 

In order to be able to perform higher-level quantitative analyses on the 
data, we departed from the scoring system originally used by Kreutzer and 
associates. Instead of attempting to describe children’s responses to each of 
the situations separately, we coded responses and assigned points to each 
response class according to the level of sophistication. A composite meta- 
memory score was then derived for the entire battery (maximum score of 30 
points, including 2 points awarded for responses to one of the probe ques- 
tions! asked in connection with the categorization task described above). 
Kurtz, Reid, Borkowski, and Cavanaugh (1982) have shown that pooling the 
various subtests of the Kreutzer et al. (1975) interview to yield a composite 
score enhances reliability (see also Schneider, 1985). Each of the four tasks is 
described briefly below. 

Story list for assessing children’s knowledge of the efficacy of elaboration: 
Subjects were presented the situation that two children had to try to remem- 
ber eight pictures (man, bed, tie, shoes, table, dog, hat, and car). One child 
only saw the pictures, while the other saw the pictures and heard a story 
involving all eight items. Subjects were to indicate which condition made it 
easier to remember, which child would learn the most, and why. One point 
each was awarded for stating that the story made it easier and that the child 
hearing the story would remember more. Additional points (maximum=4) 
were awarded for ability to verbalize an explanation for why elaboration 
would enhance retrieval. 

Preparation object for measuring planning behavior in preparing for fu- 
ture retrieval: Instructed to assume that they were going skating the next 
morning, subjects were asked what they would do to make sure that they did 
not forget their skates. Possible answers were grouped into four categories 
with a one-point credit for an answer from each category (maximum of 4 
points). The categories were relying on external cues having to do directly 
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with manipulation of the skates themselves; relying on external cues pro- 
vided by others; relying on external cues involving the use of another object 
(e.g., a note); and relying on internal mental cues. 

Retrieval event for assessing systematic memory search for a past event: 
Subjects were asked to assume that they had lost a jacket and to indicate how 
they would go about finding it. A maximum of 7 points were awarded for a 
variety of potentially useful strategies. 

Rote paraphrase for assessing children’s knowledge of the relative ease of 
gist recall over rote recall: Presented with a hypothetical child who was 
required to remember a story, subjects were asked to indicate whether the 
story would be easier to learn by rote or by paraphrasing it. Through a series 
of questions subjects were awarded points (maximum= 11) for verbalizing an 
understanding of how recalling word for word is more difficult than recalling 
the gist, and for showing awareness of the advantages of the hypothetical 
character knowing the task requirements before choosing the method of 
study. 

To ascertain the reliability of the metamemory scoring system, 10% of the 
protocols were selected randomly and scored independently by the second 
author and another scorer. The two scorers arrived at the same score for each 
protocol with the exception of one on which they differed by one point. A copy 
of the scoring key may be obtained through the first author. 


Results and Discussion 
Experiment 1 


Employing the “central versus incidental memory” paradigm, this experi- 
ment used a group-administered recall-only task to test two hypotheses: 
1. that across groups, students instructed explicitly to memorize for later 
recall would perform better than those not instructed to do so, and 
2. that the average readers’ performance on the intentional memory task 
would be superior to their performance on the incidental memory task. 
The mean recall performance for each group of students under the two 
memory conditions are reported in Table 2. A 2 (memory type) x 2 (group) 
analysis of variance, with verbal IQ covaried out, produced a significant main 
effect for memory type (F=19.55, p<.001), no main effect for group (F=1.71, 
p=.19), and no interaction effect (F=0.34, p=.55). The results of the experi- 
ment confirmed our first hypothesis regarding the superiority of intentional 
memory over unintentional memory. Support was not found for the second 
hypothesis. The superior recall performance under the intentional condition 
is presumed to be the result of relatively more conscious strategic processing 
of information. Therefore, the absence of a significant interaction between 
memory type and group would appear to suggest that at least on the group 
administered task, poor and average readers did not differ in their manage- 
ment behavior. However, visual inspection of the means (Table 2) reveals 
that the average readers were generally stronger than the poor readers on 
the intentional task (difference in means=1.4) than they were on the unin- 
tentional task (difference in means=0.7). 
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Table 2 
Means and Sample Sizes for Average and Poor Readers by Memory Type 


Average Readers Poor Readers 
Intentional 12.22 10.82 
(n=11) (n=11) 
Unintentional 9.00 8.27 
(n= 14) (n= 12) 


Experiment 2 


In this experiment, we examined group differences in recall performance, 
strategy use, metamemory, and study time. Table 3 reports the means and 
standard deviations for each variable by group and the results of both multi- 
variate and univariate analyses of variance performed on the data. The 
analyses showed that average readers recalled significantly more items, 
utilized significantly more sophisticated strategies, and possessed richer 
metamemorial knowledge. The two groups of students did not differ sig- 
nificantly on study time, although average readers tended to spend more 
time studying the task than did poor readers. 

Even after covarying out the potential effects of the original verbal IQ 
differences, the average readers remained superior in both strategy use 
(F=5.23, p<.05) and metamemory (F=4.51, p<.05), although the two 
groups were no longer different in terms of recall performance. The finding 


Table 3 
Group Differences on the Four Dependent Measures' 


Average Readers Poor Readers 
Mean SD Mean SD Univariate F* 
Recall 10.6 2.5 9.2 2.1 4.51 * 
(10.5) (9.4) 
Strategy use 4.8 2.1 Biz 1.8 8.49 ** 
(4.7) (3.2) 
Metamemory 21.4 3.9 18.1 4.9 6;27.* 
(21.2) (18.2) 
Study time? 2.4 0.9 1.9 1.3 1.49 NS 
(2.3) (2.0) 


ee nn nnn EEEEEEEEEEEeeeneee 


Multivariate Fs 45 = 3.24; p<.05 

St OS eam O4 

‘Numbers in brackets are adjusted means after covarying out verbal IQ. 
2F-values, prior to covarying out verbal IQ. 

3Study time is reported in minutes. 
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that average and poor readers differed in strategic (management) behavior 
and metamemorial knowledge, even when they had been roughly equated on 
basic intellectual ability, suggests that inefficiency in strategy utilization and 
low metacognitive knowledge may be important ingredients of the under- 
achievement displayed by the poor readers. 

However, while the management hypothesis was confirmed statistically 
in this analysis, two observations emerging from our descriptive analysis cast 
some doubt on the true practical significance of that finding. The first 
observation is the strikingly low mean strategy use score for both average 
and poor readers. The maximum attainable strategy sophistication score was 
16, but even the more efficient strategy users had a mean score of 4.8. In both 
groups, the most frequently utilized strategy was unorganized rehearsal (1.e., 
repeated rehearsal of all 15 items together). Thirty-five percent of the poor 
readers and 56% of the average readers employed this strategy. Strategies 
considered to be superior to unorganized rehearsal (e.g., organization via 
conceptual categorization, meaningful association of items, elaboration, or 
chained rehearsal) were used by only 32% of average readers and 13% of poor 
readers. 

It must be noted, however, that the low level of strategy sophistication 
found in this study is consistent with the picture that emerges from develop- 
mental studies, namely, that children at or below 11 years of age generally 
employ less sophisticated strategies than older children or adults. For ex- 
ample, describing developmental trends in the use of strategies, Kail and 
Hagen (1982) noted that the first inkling of mature strategy use does not 
emerge until about age 10 years. On the other hand, there is extensive 
evidence to indicate that the less sophisticated rehearsal strategy is common- 
ly employed by younger children. For example, Flavell, Beach, and Chinsky 
(1966) found the use of rehearsal by 65% to 85% of 10-year-olds. Kail and 
Hagen’s (1982) review indicated that rehearsal becomes a common learning 
strategy at about 9 or 10 years. In the present study, the most commonly used 
strategy was rehearsal (46% of all subjects used it). 

The second noteworthy observation is that although none of the poor 
readers (and only four of the average readers) spontaneously employed cat- 
egorical organization to aid memorization, most subjects in both groups were 
able to sort the items into the three distinct categories when asked to do so 
during the postrecall probe session. In fact, the two groups did not differ on 
the categorization skill score derived from this probe. The means for this 
variable for the poor and average readers were 4.3 (SD=1.2) and 4.9 
(SD=1.5), respectively. Thus there was a clear case for production deficiency 
as a good explanation for children’s failure to use categorical organization in 
this study, but this case was as true of the average readers as it was of the 
poor readers. 


Interrelationships Among Recall, Strategy Use, and Metamemory 


Correlations among recall performance, strategy use, metamemory, and 
study time were calculated separately for the two groups. Among the poor 
readers, students utilizing more efficient strategies recalled more items 
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(r=.45, p<.05). Also, students who studied the task longer recalled more 
items (r=.55, p<.01). The strongest correlation found in this group was 
between strategy use and metamemory (r=.71, p<.001). Although positive 
correlations were found between recall performance and metamemory 
(r=.26) and between strategy use and study time (r=.25), these relationships 
were not significant. A larger number of intercorrelations was found in the 
average readers group. Recall performance correlated significantly with 
strategy use (r=.56, p<.01), metamemory (r=.71, p<.001), and study time 
(r=.38, p<.05) while strategy use correlated significantly with metamemory 
(r=.65, p<.001) and study time (r=.37, p<.05). 

Thus our hypothesis concerning the within-group interrelationships 
among the three main memory variables was confirmed wholly for average 
readers and partially for the poor readers. In both groups, students who 
employed more sophisticated strategies recalled more items, and students 
with superior metamemory utilized more sophisticated strategies. However, 
only in the average reader group was superior metacognitive knowledge 
significantly related to better recall. Also noteworthy was the difference 
between the two groups in the relationship between study time and strategy 
use. Compared with the poor readers, average readers appeared to have 
invested more of their study time in strategic activity. 

In order to go beyond the relationships reported for each group to make 
broader generalizations about the three-way relationships among recall, 
strategy use, and metamemorial knowledge, a separate analysis based on the 
total sample of 48 subjects was performed. Pearson product-moment correla- 
tions among the variables were computed, simultaneously partialling out the 
effects of verbal IQ and reading proficiency. The correlation coefficients for 
all three sets of relationships were statistically significant with the strongest 
correlation being that between strategy use and metamemory (r=.69, 
p<.001), followed respectively by recall and strategy use (r=.58, p<.001), 
and recall and metamemory (r=.54, p<.001). This ordering in the strength 
of correlations provides some support for aspects of Borkowski’s (Borkowski 
& Kurtz, 1984) theoretical formulations regarding relationships among the 
three variables. Borkowski has suggested that “metamemory should be link- 
ed more closely to strategic behavior than to recall accuracy ... because 
metamemory is a setting variable for strategy use” (p. 201). More important- 
ly, however, these correlations confirm the general theoretical proposition 
that metacognitive knowledge should be related not only to strategic be- 
havior but also to recall performance and are consistent with conclusions 
drawn recently by Schneider (1985) from his meta-analysis of extant re- 
search. 


Summary and Instructional Implications 
This study clearly confirmed the conceptualized superiority of intentional 
memory over unintentional memory, corroborated existing evidence that 
efficient strategic behavior is associated with better recall performance, and 
provided evidence that general metacognitive knowledge may be an impor- 
tant source of variation between average achievers and underachievers in 
reading. However, the evidence in support of the traditional management 
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hypothesis was rather weak. The evidence for strategic behavior differences 
between the average and poor readers was not as strong as the evidence for 
metamemorial knowledge differences. Not only did the average readers have 
superior metamemorial knowledge, but they appeared also to have been able 
to take better advantage of their general knowledge of memory in their recall 
performance than the poor readers. 

With regard to strategic behavior, perhaps what is insightful about this 
study lies more in the similarities between the two groups than in the 
differences. Not only was the level of strategy sophistication low among both 
groups, but average and poor readers alike manifested the classic production 
deficiency that has traditionally been associated with low ability and learning 
disabled students. Production deficiency refers to the situation whereby 
children who have the relevant prior knowledge (or strategy) often fail to use 
it to advantage on appropriate occasions (see Brown, 1987). One inference 
that can be drawn from these results is that the strategic behavior attributes 
described in the study appear to be more closely tied to age than to level of 
achievement. This inference is worth examining in future research; however, 
such research must ensure greater variability in level of achievement than is 
the case in the present study. It is conceivable that a comparison of poor 
readers with more-than-average readers would produce more dramatic 
strategic behavior differences. 

Finally, this study has demonstrated that regardless of real or potential 
between-group differences in recall performance, strategic behavior, and 
general metamemorial knowledge, the latter two are sources of significant 
individual differences in recall. 


Instructional Implications 


As indicated in the opening paragraph of this article, memory and general 
strategic behavior are core components of the learning process. Thus, while 
the experimental tasks employed in this investigation are not typical class- 
room tasks to the extent that the processes engendered by these tasks can be 
held to mirror general processes that children bring to cognitive tasks, the 
findings of this study have practical relevance to classroom strategy instruc- 
tion. In this section we present several instructional implications. The partly 
correlational nature of our data does not allow us to make strong inferences 
about how strategic and metacognitive processes are causally linked to aca- 
demic performance. However, the suggested applications are strengthened 
by the fact that corroboration for aspects of our findings exists in the ex- 
perimental literature. 
Enhancing performance through strategy training. Two of our findings 
underscore the importance of strategy training: 
1. that across groups the intentional memory condition produced better 
recall performance than the unintentional recall condition, and 
2. that students who used more sophisticated strategies also recalled more 
items. 
Seen in relation to the view that cognitive strategies are a learnable—and 
hence a teachable—capability (Gagne, 1985; Marfo, Mulcahy, Peat, & 
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Andrews, in press), these findings suggest that one way to improve learning 
and performance in school is to include strategy instruction in the cur- 
riculum. The focus and purpose of such explicit instruction in cognitive 
strategies will be to enable students to consciously devise and use strategies 
when confronted with cognitive tasks. A discussion of the how of teaching 
strategies is beyond the purview of this paper; one potentially viable ap- 
proach to strategy instruction is described in detail by Mulcahy, Marfo, and 
their associates (Marfo et al., in press; Mulcahy & Marfo, 1987; Mulcahy, 
Marfo, Peat, & Andrews, 1986; Mulcahy, Marfo, Peat, Andrews, & Clifford, 
1986). 

The relevance of strategy instruction cuts across ability and achievement 
levels. A second noteworthy implication emerging from this study is that 
explicit strategy training is necessary not only for underachievers but also for 
average achievers. Recall that the correlation between strategy use and recall 
performance was positive and significant for both groups of students, and 
both groups of students displayed a relatively low level of sophistication in 
strategy use. This is not to suggest that students at different ability or 
achievement levels have similar strategy instructional needs. For example, 
our results show that the average readers may have invested more of their 
study time in strategic pursuits than the poor readers. Thus, while all stu- 
dents can benefit from strategy instruction, the focus and nature of instruc- 
tion will have to vary from student to student. 

Embedding metacognitive activities in strategy instruction may enhance 
the outcomes of such instruction. Third, from the strong correlations between 
metacognitive knowledge and strategy use, it appears that embedding meta- 
cognitive activities in strategy instruction could produce even better results. 
That is, providing information about how and why specific strategies may be 
appropriate or inappropriate for specific tasks may serve to enhance the 
spontaneous generation and/or use of strategies. Support for this recommen- 
dation comes also from the work of Pressley and his associates (e.g., Pressley, 
1986; Pressley, Borkowski, & O’Sullivan, 1984) who have examined the role 
of metacognitive acquisition procedures in strategy learning and generaliza- 
tion. 


Note 


1. This probe required subjects to indicate whether it would have been easier to memorize 
the pictures if the items were presented in a categorized format. 
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Effectiveness of Teacher-Administered 
Tests and Rating Scales in Predicting 
Subsequent Academic Performance’ 


Early screening is necessary to facilitate prompt intervention with respect to 
learning difficulties. Thus it is of practical value to investigate the efficacy of tests 
and rating scales which teachers can administer, as compared with individual 
psychological assessments, in screening for learning difficulties. To this end, 197 
kindergarten children were assessed on an extensive battery of individually ad- 
ministered assessment instruments including the Stanford-Binet and Metropoli- 
tan Readiness Tests. Teacher assessments involved the Davids and Myklebust 
rating scales for pupil behavior. Follow-up assessment of the children’s achieve- 
ment was completed at the end of grade 1. Indications are that the best predictors 
of grade 1 achievement are the Metropolitan Readiness Tests and the Myklebust 
Pupil Rating Scale. This offers support for the view that appropriate teacher 
rating scales and group tests which teachers administer can effectively predict 
subsequent academic performance and encourages further study of the potential of 
the classroom teacher as the primary screening agent for learning difficulties. 


The early identification of children with learning difficulties is a recurring 
theme in psychological and educational research. Karly detection of school 
learning problems is a challenge, but chief among the benefits is the avoid- 
ance of needless failure and the concomitant psychologically maladaptive 
behaviors that compound the complex learning situation (Casto & 
Mastropieri, 1986; Lazar & Darlington, 1982). Clay (1985) has stated her 
support for early identification of learning problems, specifically reading 
disabilities. She outlines four major problems resulting from delayed 
remediation: a great gap or deficit to be made up; consequential deficits in 
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other areas of education; consequences for the child’s personality and con- 
fidence; and the practice over a period of time of incorrect behaviors which 
must be unlearned before appropriate strategies can be taught. 

Previous research efforts to develop an effective index for predicting 
school learning difficulties have emphasized the use of psychometric testing 
procedures assessing numerous aspects of general intellectual ability as well 
as specific skills related to classroom functioning. It is recognized that some 
variance in grade 1 achievement is accounted for by differences in cognitive 
ability (Gullo, Clements, & Robertson, 1984; Tollefson, Rodriguez, & Glaz- 
zard, 1985). Deficiencies may occur in coordination, language, memory, 
visual and auditory discrimination, and sequencing (Schmidt & Perino, 1985; 
Swanson & Mason, 1984; Tollefson et al., 1985). Specific preacademic skills 
such as knowledge of the alphabet, numbers and colors, the ability to print 
one’s first name, and recognition of various simple geometric shapes are also 
associated with success in grade 1 (Barbe & Shields, 1976). The manner in 
which students approach activities, their organization, motivation, task 
orientation, interest, and perseverance also emerge as factors related to 
school performance (Hightower et al., 1986). In addition, school performance 
is related to the manner in which children interact socially and their ability 
to follow classroom rules and conform to behavioral requirements (Cooper & 
Farran, 1988). 

The assumption has been that individual assessments by highly trained 
specialists are necessary to obtain accurate information regarding these 
factors. Indeed, while such procedures have proven to be of reasonably high 
predictive value, their cost in terms of time, money, and labor makes such 
examinations impractical as primary screening tools (Ysseldyke et al., 1983). 
Recent thinking in school systems recognizes that instructional decision 
making relies on a teacher’s ongoing evaluation of individual student needs 
(Gerber & Semmel, 1984; Shinn, Tindal, & Spira, 1987). It is suggested that 
specialists and consultants would provide a more valuable service by assist- 
ing with program development, empowering teachers to use their observa- 
tional skills effectively, and entrusting them with the responsibility of 
identifying those students at risk for developing problems with achievement 
in school. 

Findings from research focusing on the teacher’s role in screening for 
learning problems have varied. Fletcher and Satz (1984) and Stevenson, 
Parker, Wilkinson, Hegion, and Fish (1976) reported that only about 20% of 
those children who subsequently experienced severe learning difficulties 
were correctly identified by their preschool teachers as children in need of 
special assistance when global descriptions were involved. However, when 
checklists or rating scales which included items related to the criterion 
performance (i.e., school achievement) were included, predictions in the 
range of 70 to 85% accuracy were made (Becker & Snider, 1979; Lindsay & 
Wedell, 1982; Simner, 1986). This is not surprising when one considers that 
the teacher is aware of the child’s functioning levels over an extended period, 
that his or her examination does not rely on a child’s cooperation during a 
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single, specific testing session, and the child’s environment is not disrupted 
by removal from the classroom to obtain the information. 

As it would be reasonable to institute an early school screening program 
involving teacher administered tests and teacher ratings of children’s learn- 
ing characteristics, and the predictive potential of some rating scales such as 
the Myklebust Pupil Rating Scale (e.g., Doherty & Margolis, 1981) appears to 
be good, it would be valuable to investigate the efficacy of teacher assess- 
ments compared with individual psychological assessments in screening for 
learning problems. 

The present research project attempted to obtain information on the 
effectiveness of rating scales and tests which can be efficiently administered 
by classroom teachers in predicting subsequent academic performance. Such 
research is preliminary to studying the potential of classroom teachers as 
primary screening agents for learning difficulties. 


Method 
Participants 


The sample was drawn from four schools in the public school system, each 
having large populations considered to be representative of the range of 
socioeconomic levels in the city of Edmonton, Alberta. Using the Blishen 
Socio-Economic Index for Occupations in Canada (Blishen & Roberts, 1976), 
the mean family socioeconomic status (SES) rating of the sample in kinder- 
garten was 4.4 (where a rating of 1 is very high and 6 is very low). This is 
consistent with the norms which show a predominance of lower ratings (4 to 
6) in the general Canadian population. Parental permission was obtained to 
assess 200 children, 5 to 6.5 years of age (mean = 67.0 months; SD = 3.8 
months) from 10 kindergarten classes. The sample was intended to be a 
normal distribution of children, not a sample of children with learning 
problems. Three of the children were not assessed as one was considered too 
young and two others were found to have been of very low birth weight 
(<1500 grams), which would have introduced a potential intervening vari- 
able. Thus the final kindergarten sample consisted of 197 children, 104 girls 
and 93 boys. Of that group, 171 children were still residing in the city one 
year later and were assessed again at that point. 


Instruments 


The length of the comprehensive, individually administered, kindergarten 
test battery was limited by the need to confine testing to one session and to 
avoid undue fatigue or distress to the children. The measures were chosen to 
assess levels of functioning in five areas considered to be predictive of learn- 
ing disabilities, namely, intelligence, attention deficit behavior, linguistic 
processes, visual processes, and memory (Meyer, 1988). 

The kindergarten battery included the following tests: Stanford-Binet 
Intelligence Scale (SBIQ, Form LM), selected as having norms that extended 
below five years of age; Metropolitan Readiness Tests (MRT, Form P, Level 
1), chosen to measure preacademic skills of auditory memory, rhyming, letter 
recognition, visual matching, school language and listening, and quantitative 
language; Peabody Picture Vocabulary Test (PPVT) of receptive vocabulary; 
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and Developmental Test of Visual-Motor Integration (DTVMI), assessing the 
integration of visual perception and motor coordination. These tests are 
considered to be among the more reliable and valid tests available for kinder- 
garten assessment batteries (Salvia & Ysseldyke, 1988). The MRT is design- 
ed to be administered by teachers as a group test whereas the other three 
tests are intended for individual administration by trained personnel. The 
Davids Rating Scale for Hyperkinesis (DRS), a measure of behavior problems 
for children at risk for educational difficulties, and the Myklebust Pupil 
Rating Scale (PRS), an instrument for learning disabilities screening, both 
rating scales designed for teacher use, were also part of the battery. 

The grade 1 achievement assessment consisted of the Canadian Test of 
Basic Skills (CTBS, Level 6, Form 5), including subtests of listening, vo- 
cabulary, word analysis, reading, language, and mathematics. (In discussion 
of the results, CTBS will denote the composite percentile score for the test.) 
As well, teachers were asked to rate the children on reading, mathematics, 
oral language, and vocabulary using a simple three-point scale, where a score 
of 1 was below average, and 3 was above average. These four scores were 
summed to obtain a composite teacher rating (Rtot) for each child. 


Procedure 


All children were assessed individually, each in a two-hour testing session in 
their respective schools during April and May of their kindergarten year. All 
testing and scoring was done by the first author and three research assistants 
trained by the first author. When all assessments were completed, classroom 
teachers rated each child on the two rating scales (DRS and PRS) under the 
direction of the first author who recorded all teacher responses. 

In May of the following year the CTBS was administered to all children, 
individually or in groups depending on the numbers of children in particular 
schools, by two research assistants who also did all scoring. As well, teachers 
completed the brief rating scale previously described for each child. 


Results 
While the total number of children assessed at kindergarten level was 197, 
the maximum number of cases for kindergarten variables varies from 189 to 
197 due to some missing data on one or more tests in the battery. The mean 
(M) and standard deviation (SD) for each variable are reported in Table 1. 
Note that the sample performs above the normative mean score on both the 
SBIQ and PPVT but below the normative mean score on the MRT. 

At the grade 1 level, 171 students (94 boys and 77 girls) remained in the 
sample, 26 having moved out of the city. Mean age at grade 1 testing was 79 
months (SD = 3.8 months). Table 1 also reports mean performance (and 
SDs) on the grade 1 tests and ratings. Mean performance for the group on the 
CTBS was 42.0, below the mean percentile for the test’s normative distribu- 
tion but in the average range. 

At both kindergarten and grade 1 levels, the sample appeared to fit the 
description of a “normal” distribution, with more children scoring in the 
average range and fewer at the ends of the distribution. 
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Table 1 
Means, SDs, and Ns for Standardized Tests and Rating Scales 
Administered in Kindergarten and Grade 1 


n M SD 
Kindergarten (N=197) 
Stanford-Binet IQ Score 197 107.6 14.1 
Metropolitan Readiness Composite Percentile 197 36.7 23.2 
Peabody Picture Vocabulary Test Standard Score 197 107.4 16.3 
Beery DTVMI Transformed Score (MA-CA + 100) 197 97.0 8.2 
Davids Hyperkinesis Rating Scale Total Score 189 led 5.8 
Myklebust Pupil Rating Scale Total Score 189 75:7 18.7 
Grade 1 (N=171) 
CTBS Composite Percentile Score (CTBS) 171 42.0 26.5 
Composite Mean Teacher Rating (Rtot) 171 2.0 0.6 


Intercorrelations between tests and ratings for kindergarten and grade 1 
data are reported in Table 2. Note that the strongest intercorrelations were 
between the MRT and SBIQ (.60), MRT and PRS (.57), MRT and CTBS (.66), 
SBIQ and CTBS (.64), and the PRS and CTBS (.56). A stepwise regression 
analysis indicated that the best predictors of grade 1 CTBS were the MRT 
(R=.66), MRT and SBIQ (R=.73), and MRT, SBIQ, and PRS (R=.75). No 
other variables met the .05 significance level for entry into the model. 

Table 2 also indicates that for grade 1 teacher ratings (Rtot) the strongest 
intercorrelation was between MRT and SBIQ and Rtot (.60 and .51, respec- 
tively), but the intercorrelation between the PRS and Rtot was somewhat 
lower (.42). A stepwise regression analysis indicated that the best predictors 
of teacher composite ratings for grade 1 were the MRT (R=.61) and MRT 
and SBIQ (R=.63), with no other variables meeting the .05 significance level 
for entry into the model. 


Table 2 
Intercorrelation Matrix: Kindergarten Tests and Teacher Rating Scales 
with Grade 1 CTBS Composite Score (CTBS) and 
Teacher Rating Total Score (Rtot) 


MRT SBIQ. PPVT ~~ DTVMI DRS PRS CTBS_Rttot 
MRT 60 46 25 ~15 57 66 60 
SBIQ 51 33 -06 44 64 51 
PPVT 04 -06 35 39 26 
DTVMI ~04 19 21 29 
DRS -55 4 -06 
PRS 56 42 
CTBS 74 
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A series of discriminant function analyses was also performed in an 
attempt to determine the extent to which the psychoeducational tests and 
behavior ratings for the sample of students in kindergarten would differen- 
tiate two identified subgroups of these children one year later. 

It would seem reasonable to expect the kindergarten test results to predict 
accurately those children who stand out as being above or below average in 
grade 1 performance, thus potentially needing special attention in the form 
of enrichment or remediation. Two ways were available to classify the child- 
ren in this manner: (a) those who scored in the upper (25 students; 15% of 
sample) and lower (48 students; 28% of sample) quartiles on the CTBS in 
grade 1 (N=171); and (b) those who obtained below average (<4) (n=48; 
28%) versus above average (>8) (n=50; 30%) total teacher ratings. 

For the CTBS classification, all the tests in the battery (MRT, SBIQ, 
PPVT, DTVMI, DRS, and PRS) correctly predicted group membership in the 
upper and lower quartiles for 93.2% (68) of the 73 students so classified by 
the CTBS. The MRT and PRS correctly classified 90.4% (66), and the PRS 
alone correctly classified 86.3% (63). Thus the teacher rating scale alone was 
less accurate than the full battery, but the two instruments which can be 
teacher administered (MRT and PRS) were almost as good as the full battery 
for predicting group membership so defined. Thus little accuracy is lost in 
moving from the six-predictor to the two-predictor system. 

For classification on the basis of high or low grade 1 teacher ratings, the 
MRT and PRS were only slightly less accurate (78.6%, or 77 of 98 students) 
than the full battery (81.6%, or 80 of 98 students), in correctly predicting 
group membership. The PRS alone accurately predicted group membership 
for 74.5% (73 students). 

Thus, for predicting the two classifications of extreme groups, the pupil 
behavior rating scale (PRS) alone was least accurate, but when coupled with 
the MRT, a second test which can be teacher administered, was about as 
accurate as the total battery used in this study. Of the two classification 
systems, that using the CTBS is likely the more accurate measure of grade 1 
performance, as it involves a standardized achievement test rather than a 
simple three-point teacher rating scale. 


Discussion and Conclusions 
Taking into consideration the intercorrelational data, stepwise multiple re- 
gression analyses, and particularly the discriminant function analyses, it is 
apparent that the MRT and PRS, both of which can be teacher administered, 
are among the best predictors of grade 1 CTBS and grade 1 teacher ratings. 
This finding is consistent with the literature on predictive usefulness of these 
two tests (e.g., Doherty & Margolis, 1981; Gullo et al., 1984) and usefulness 
of teacher administered tests as primary screening tools (e.g., Gerber & 
Semmel, 1984). The fact that the prediction of global teacher ratings was less 
accurate than predication of CTBS groups is likely a consequence of lower 
reliability of the global ratings. Interestingly enough, on the predictor side of 
the equation the PRS, which is also a rating scale, did not seem to suffer the 
same problem. However, it has been shown (e.g., Fletcher & Satz, 1984; 
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Simner, 1986) that global teacher ratings are less reliable than teacher 
checklists or rating scales. 


At the grade 1 level, and particularly for the CTBS, the two tests which 
can be teacher administered are almost as good as the larger battery in 
predicting grade 1 outcome. This study suggests that well-chosen teacher 
administered tests and rating scales at kindergarten may have good potential 
as an efficient and effective screen for children in need of early remedial 
assistance. Note that the MRT and PRS represent a fairly comprehensive 
assessment of skills relevant to predicting learning disabilities (Meyer, 1988) 
and were clearly superior in accuracy to such tests as the DRS and DTVMI, 
which are also designed for individual assessments by trained teachers. 
Clearly, the accuracy of the teacher as a primary agent in screening for 
learning problems (e.g., Gerber & Semmel, 1984) is contingent on using 
appropriate screening instruments. 

Use of the teacher as the primary screening agent (Gerber & Semmel, 
1984; Shinn et al., 1987) warrants further examination as it offers potential 
to alter the nature of the school psychologist’s role to one of program consult- 
ation and planning rather than one focused on assessment of learning 
problems (Ysseldyke et al., 1983). 


Note 


1. An earlier version of the paper was presented at the Third Annual Conference of the 
Canadian School Psychology Association, Edmonton, June 1989. The writers express 
their appreciation to L. Kvemshagen, T. Hauck, S. Yee, J. Morin, and L. Narain for their 
research assistance. The research was partially funded by a Support for the Advancement 
of Scholarship grant to the second author from the Faculty of Education, University of 
Alberta. 


References 


Barbe, W.B., & Shields, B.P. (1976). Reading skills check list and activities: Readiness level. 
West Nyack, NY: Centre for Applied Research in Education. 

Becker, L.D., & Snider, M.A. (1979). Teachers’ ratings and predicting special class placement. 
Journal of Learning Disabilities, 12, 96-99. 

Blishen, B., & Roberts, H.A. (1976). A revised socio-economic index for occupations in 
Canada. Canadian Review of Sociology and Anthropology, 13, 71-79. 

Casto, G., & Mastropieri, M.A. (1986). The efficacy of early intervention programs: A 
meta-analysis. Exceptional Children, 52, 417-424. 

Clay, M. (1985). The early detection of reading difficulties (3rd ed.). Portsmouth, NH: 
Heinemann. 

Cooper, D.H., & Farran, D.C. (1988). Behavioral risk factors in kindergarten. Early 
Childhood Research Quarterly, 3, 1-19. 

Doherty, M., & Margolis, H. (1981). The effectiveness of Myklebust’s Pupil Rating Scale as an 
indicator of learning difficulties. Reading Improvement, 18, 157-160. 

Fletcher, J.M., & Satz, P. (1984). Test-based versus teacher-based predictions of academic 
achievement: A three year longitudinal follow-up. Journal of Pediatric Psychology, 9, 
193-203. 

Gerber, M., & Semmel, M. (1984). Teacher as imperfect test: Reconceptualizing the referral 
process. Educational Psychologist, 19, 137-148. 

Gullo, D.F., Clements, D.H., & Robertson, L. (1984). Prediction of academic achievement with 
the McCarthy Screening Test and Metropolitan Readiness Tests. Psychology in the 
Schools, 21, 264-269. 

Hightower, A.D., Work, W.C., Cowen, E.L., Lotyczewski, B.S., Spinell, A.P., Guare, J.C., & 
Rohrbeck, C.A. (1986). The teacher-child rating scale: A brief measure of elementary 


263 


M. Meyer, L. Wilgosh, and H.H. Mueller 


children’s school problem behaviors and competencies. School Psychological Review, 15, 
393-409. 

Lazar, I., & Darlington, R. (1982). Lasting effects of early education: A report from the 
consortium for longitudinal studies. Monograph of the Society for Research in Child 
Development, 47, No. 195. 

Lindsay, G.A., & Wedell, K. (1982). The early identification of educationally at risk children 
revisited. Journal of Learning Disabilities, 15, 212-217. 

Meyer, M. (1988). A psychoeducational evaluation of precursors to learning disabilities in the 
very low birthweight population at 5‘/2 years. Unpublished doctoral dissertation, 
University of Alberta. 

Salvia, J., & Ysseldyke, J. (1988). Assessment in special and remedial education (4th ed.). 
Boston, MA: Houghton Mifflin. 

Schmidt, S., & Perino, J. (1985). Kindergarten screening results as predictors of academic 
achievement, potential, and placement in second grade. Psychology in the Schools, 22, 
146-151. 

Shinn, M., Tindal, G., & Spira, D. (1987). Special education referrals as an index of teacher 
tolerance: Are teachers imperfect tests? Exceptional Children, 54, 32-40. 

Simner, M.L. (1986). Predictive validity of the Teacher’s School Readiness Inventory. 
Canadian Journal of School Psychology, 3, 21-32. 

Stevenson, H.W., Parker, T., Wilkinson, A., Hegion, A., & Fish, E. (1976). Longitudinal study 
of individual differences in cognitive development and scholastic achievement. Journal of 
Educational Psychology, 68, 377-400. 

Swanson, B.B., & Mason, G.E. (1984). Measuring early reading progress: Some non- 
traditional measures. Journal of Educational Research, 78, 90-96. 

Tollefson, N., Rodriguez, R., & Glazzard, P. (1985). Predicting reading achievement for 
kindergarten boys and girls. Psychology in the Schools, 22, 34-39. 

Ysseldyke, J., Thurlow, M., Graden, J., Wesson, C., Algozzine, B., & Deno, S. (1983). 
Generalizations from five years of research on assessment and decision-making: The 
University of Minnesota Institute. Exceptional Education Quarterly, 4, 75-93. 


264 


The Alberta Journal of Educational Research Vol. XXXVI, No. 3, September 1990, 265-295 


NEIL A. JOHNSON 


University of New South Wales 


and 


EDWARD A. HOLDAWAY 


University of Alberta 


School Effectiveness and Principals’ 
Effectiveness and Job Satisfaction: 
A Comparison of Three School Levels 


Educational research has been criticized for neglecting comparative studies of 
specific types and levels of school organizations. This article compares findings 
from parallel studies in elementary, junior high, and senior high schools in 
Alberta. The studies investigated levels and important determinants of school 
effectiveness, job satisfaction of principals, and the effectiveness of principals. 
Principals at all three levels expressed moderate satisfaction overall and satisfac- 
tion with their working relationships at school; satisfaction with district rela- 
tionships, workloads, inservice, and remuneration, however, varied with school 
level. While staff-related matters were most important for principals’ satisfaction, 
several facets influenced overall job satisfaction. Schools at all levels were re- 
garded as moderately effective overall and effective on most conventional criteria. 
Community liaison and extracurricular provision were among the dimensions 
where school-level discrepancies occurred. Principals at each level also rated 
themselves as moderately effective overall and most effective in decision making 
and creating a positive working environment; major school-level differences con- 
cerned principals’ adaptability and influence on staff improvement. Teachers and 
area superintendents rated effectiveness lower, highlighting the need for multiple- 
constituency studies. The importance attached to many criteria showed effective- 
ness to be a more complex phenomenon than some convenient school effectiveness 
checklists suggest. More expansive perspectives are recommended for studying 
and diagnosing effectiveness levels in schools. 


Most educational studies and theorizing to date have been pursued with only 
perfunctory acknowledgment of the distinctive qualities of various school 
levels, types, and environments. Such an approach is consistent with the 
theory movement in educational administration, which was founded on a 
belief in a universally applicable theory of organizational behavior. However, 
many scholars now assert the need to examine systematic variations in 
administration and work in educational settings of different kinds, and 
Griffiths’ (1988) recent admission is indicative of a change of attitude among 
former proponents of a widely generalizable administrative theory: “In fact, 
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it would seem that it is necessary to have theories specific to particular types 
of educational institutions” (p. 48). 

One patently logical basis for classifying school organization types for the 
purpose of theoretical argument and research is by educational level, because 
administrators, educators, and other participants in schools of different 
levels seem to display many disparate concerns and attributes. This article 
compares findings from three recent studies in elementary schools (Johnson, 
1988), junior high schools (Sroypan, 1988), and senior high schools (Gunn, 
1984; see also Gunn & Holdaway, 1985, 1986). Similar methods were used in 
the three studies to elicit perceptions from principals and other educational 
constituencies in connection with school effectiveness, the effectiveness of 
principals, and the job satisfaction of principals. The purpose of this com- 
parison is to identify points of perceptual consistency and departure to assist 
in advancing the development of theory relating to specific educational 
levels. 

Although few researchers have systematically examined organizational 
peculiarities and commonalities among elementary, junior high, and senior 
high schools, some pertinent observations have been made. In general terms, 
a review of limited available research led Hallinger and Murphy (1987) to 
posit several probable differences between elementary and secondary 
schools; these differences related to organizational goals, administrative 
structure, characteristics of teachers and students, curriculum organization 
and instruction, and links with parents and the community. Regarding dif- 
ferences among teaching practices, Holdaway and Millikan (1980) reported 
more specifically on a tendency for elementary school teachers to work more 
collegially than junior and senior high school teachers, particularly with 
regard to soliciting advice and assistance about instruction. Other writers 
(e.g., Burlinghame, 1988; Kratzmann, Byrne, & Worth, 1980) have noted 
that classroom management is a more taxing and more central function for 
teachers of adolescent students. 

Findings from two studies of educators’ job satisfaction are relevant. 
Holdaway’s (1978) Alberta research revealed important differences in the 
nature of satisfaction derived by elementary, junior high, and senior high 
school teachers. Overall job satisfaction was greatest at the elementary level, 
and lowest among junior high school teachers. Satisfaction with salary, 
sabbatical leave arrangements, involvement in decision making, quality of 
substitute teachers, ability of students, societal attitudes, and inservice edu- 
cation decreased with increasing grade level. In the United States, Mac- 
Queen and Ignatovich (1986) also identified statistically significant school- 
level differences in teachers’ overall job satisfaction, as well as in satisfaction 
with teacher-student interaction, teaching assignment, teaching achieve- 
ment, and workload. Elementary school teachers were more satisfied overall 
and with most job facets, although junior and senior high school teachers 
were relatively more satisfied with their workloads. The United States study, 
however, was hampered by poor survey response rates. Moreover, neither 
study examined the job satisfaction experienced by school administrators. 
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Although based on small samples, two sequential observational studies of 
high school and elementary school principals in the United States (Kmetz & 
Willower, 1982; Martin & Willower, 1981) provided some insight into distinc- 
tive and common attributes of the principalship at different levels. While 
principals at the two levels spent similar proportions of time on organization- 
al maintenance tasks and pupil control, curricular and instructional matters 
absorbed much more time for elementary school principals. As Kmetz and 
Willower added, the elementary principals “had more contacts with superiors 
and with parents, and spent more time on the schools’ instructional pro- 
grams and less on extracurricular activities” (p. 73). 

Other commentators (e.g., Hallinger & Murphy, 1987; Jones, 1988) have 
also remarked how instructional leadership, an alleged but ephemeral at- 
tribute of effective principals, is effected differently at the two levels: Ele- 
mentary principals tend to be personally involved in planning and 
supervising instruction, whereas secondary school principals, who are con- 
strained by larger enrollments as well as subject matter specialization and 
multilevel organizational structure, delegate leadership responsibilities and 
exert influence indirectly and symbolically. 

Such differences in effective leadership behavior may be attributable to 
structural disparities between school levels. In an important survey study 
spanning three school levels (Herriott & Firestone, 1984), teachers in ele- 
mentary schools exhibited consistently high goal consensus and central- 
ization, whereas senior high schools had low ratings on both variables; junior 
high schools scored between the two extremes. These researchers speculated 
about likely reasons for the disparity and advanced the following: influences 
related to students’ differing ages, the comparative diversity of curricula in 
secondary schools, greater staff specialization and formal departmentaliza- 
tion in secondary schools, as well as school-level differences in school size, 
sex, and experience of staff, and extracurricular programs. Herriott and 
Firestone were unwilling to label elementary schools as rational bureau- 
cracies and senior high schools as loosely coupled systems, but proposed these 
extremes as marked characteristics of the two levels. 

The few comparative studies of job satisfaction in schools, aspects of the 
principalship, and related issues all suggest pervasive differences in the 
work, experiences and attitudes of elementary, junior high, and senior high 
school personnel. Hence the description of “the distinction between elemen- 
tary, junior high, and senior high schools” as “one of the more prominent 
characteristics of American public schools” (Herriott & Firestone, 1984, p. 
49) is understandable. In view of such marked differences in organizational 
life and administration at different school levels, Griffiths’ (1988) support for 
theories for each organization type is defensible, and Holdaway and 
Millikan’s (1980) call for “more data ... concerning practices in different 
settings, that is in classrooms, schools and school systems of different type, 
size and location” (p. 209) seems warranted. Similar recommendations are 
found in the work of Hallinger and Murphy (1986), Johnson (1987), Jones 
(1988), and Purkey and Smith (1983), and Kmetz and Willower’s (1982) 
replicatory study likewise prompted them to “underscore the importance of 
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cumulative and comparative work” (p. 76) that probes differences among 
schools of different levels. 

It is surprising, then, that relatively little attention has been devoted to 
systematic development of theory and research relating to specific education- 
al levels, in connection with job satisfaction, school effectiveness, or indeed 
most other aspects of educational organization. For example, Daresh and Liu 
(1985) noted the elementary school bias in studies of effective schools and 
were forced to conclude from apparent school-level differences that it is 
difficult to apply the findings from that body of research literature. They 
recommended that “the factors that characterize effective elementary 
schools may indeed be relevant for the analysis of secondary schools, but it is 
clear that more studies of school effectiveness and instructional leadership 
must be directed toward middle, junior, and senior high schools” (pp. 1-2). 
Hallinger and Murphy (1987) and Jones (1988) also expressed this concern. 

Aspects of the work and effectiveness of principals at different levels seem 
to represent a particular void in extant understanding: Leithwood’s (1986) 
review of empirical research literature yielded only 10 studies that explicitly 
compared the work of elementary and secondary school principals, and he 
concluded that “the results of these studies were sufficiently tentative to 
make them an unreliable basis for either understanding or action” (p. 100). 
There remains a clear need for concerted appraisal of school-level differences 
in school administration, organization, and attitudes in a variety of locations. 

The program of three studies referred to in this article fulfilled this intent. 
The three studies were conducted in the province of Alberta to examine and 
compare the perceptions of elementary, junior high, and senior high school 
principals in relation to their levels of job satisfaction, the effectiveness of 
their schools, and their effectiveness as leaders. The initial study (Gunn, 
1984) also dealt with perceptions of the bases of principals’ influence, but this 
aspect was discarded in favor of more extensive investigation of the satisfac- 
tion and effectiveness issues in the two comparative studies that followed. 

While the focus of this article is on school-level comparisons, the substan- 
tive emphases of the three studies also deserve comment. 

First, there is now a well established literature on school effectiveness, 
particularly in relation to United States schools. Pervading that literature 
(e.g., Edmonds, 1979; Fullan, 1985; Lezotte, 1986; Purkey & Smith, 1982, 
1983) are a few variables alleged to be core components of effectiveness. 
Edmonds’ early commentary on school effectiveness variables is represent- 
ative; he included (a) high expectations of instructional effectiveness among 
staff, (b) strong leadership by the principal, (c) an orderly, quiet, and work- 
oriented atmosphere, (d) an emphasis on academic activities and develop- 
ment, and (e) frequent assessment of students. However, critics of this 
mutually confirming body of research and theoretical writing (e.g., Clark, 
Lotto, & Astuto, 1984; Good & Brophy, 1986; Mackenzie, 1983; Rowan, 
Bossert, & Dwyer, 1983) have noted fundamental research and interpreta- 
tion problems such as an undue reliance on achievement tests to measure 
effectiveness, overreliance on elementary schools in studies, unjustified as- 
sumptions about causality of effectiveness variables, neglect of interschool 
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differences during aggregation of data, and assumptions that findings are 
generalizable to other locations. Some educational writers (e.g., Clark et al., 
1984; Hoy & Ferguson, 1985; Ratsoy & Miklos, 1985) have therefore ad- 
vanced a broader range of criteria for assessing effectiveness, taking into 
account educational goals as well as the quality of organizational inputs, 
processes, and other beneficial outcomes. Similarly, writers on program 
evaluation (e.g., Nevo, 1983) have stressed the importance of appraising the 
quality of educational program design and processes, as well as outcomes. 
These expansive views of school effectiveness are more in accordance with 
the perspectives of organizational effectiveness writers from other manage- 
ment fields (e.g., Cameron, 1986; Cameron & Whetten, 1983; Steers, 1977). 
In view of this disjunction of perspectives on effectiveness, a major purpose 
of the studies described below was to examine the performance of schools on 
a wide range of criteria such as official and operative organizational goals, 
satisfaction and other desirable characteristics of staff members, leadership 
effectiveness, appropriate organizational structure, climate and work tech- 
nology, as well as interactions between the organization and the wider polit- 
ical and social environment. The relevance of those criteria for assessing the 
effectiveness of schools in Alberta was also examined. 

The second substantive matter worthy of mention is the choice of effec- 
tiveness and job satisfaction as variables for the three studies. It is common 
to link school effectiveness with the principal’s effectiveness in emphasizing 
goals, exercising instructional leadership, and promoting collegial work (e.g., 
Bossert, Dwyer, Rowan & Lee, 1982; Duignan, 1985; Johnson & Snyder, 
1985; Little, 1982; Rosenholtz, 1985). Organizational effectiveness writers 
(e.g., Cameron & Whetten, 1983; Mott, 1972; Steers, 1977) have highlighted 
also the leader’s effectiveness in decision making and establishing open 
communication, adaptability, and flexibility in the organization. With regard 
to job satisfaction research, investigators in fields other than education (e.g., 
Lawler & Porter, 1967, 1976) have tried to identify a potential role of 
satisfaction in enhancing the productivity of workers; and organizational 
effectiveness researchers (e.g., Steers, 1977) have tended to treat job satisfac- 
tion as one of the dimensions of effective organizational performance. Of 
course, some writers (e.g., Brayfield & Crockett, 1970; Gross & Etzioni, 
1985; Kelly, 1980) have disputed claims of any substantial and systematic 
link between job satisfaction and performance. Despite the importance and 
extent of this research and debate, the possibility of a relationship between 
job satisfaction and effectiveness seems to have attracted little attention 
from researchers in educational administration. Accordingly, an investigati- 
on in connection with effectiveness and the satisfaction of school principals 
was warranted. 


Methods Used for the Studies 
Gunn (1984) elicited questionnaire responses from 133 (85.8%) of the 155 
senior high school principals (grades 10-12) in Alberta. Using a scale of 1 
(highly dissatisfied) to 6 (highly satisfied), respondents rated their overall job 
satisfaction and satisfaction on 35 individual work facets, the overall effec- 
tiveness of their schools and effectiveness on an array of nine organizational 
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performance criteria, and their overall effectiveness as leaders as well as 
their effectiveness on nine criteria commonly associated with effective school 
leaders. Subsequent interviews with 10 senior high school principals con- 
firmed and probed some of the major findings. 

Subsequently, two parallel studies were conducted in elementary schools 
(Johnson, 1988) and junior high schools (Sroypan, 1988). Pilot testing of 
instruments and interview methods for those studies called into question the 
need for a scale with six alternatives for rating satisfaction and effectiveness; 
however, in the light of the foregoing review, the benefit available from direct 
comparison of elementary, junior high, and senior high school findings jus- 
tified the retention of Gunn’s (1984) six-point scale. This decision to pursue 
comparative research did not preclude considerable expansion of the inquiry 
in other ways. 

Preparation for the elementary school study included reviews of research 
literature on effectiveness and job satisfaction, exploratory interviews, and 
two pilot studies of a greatly expanded version of Gunn’s (1984) survey 
instrument. Questionnaires were then distributed to 131 principals of ele- 
mentary schools (grades k-6) in Alberta—a 20% random sample of the 654 
principals in the province. Responses were received from 112 (85.5%) of the 
principals surveyed. The same questionnaire was sent to all 94 junior high 
school principals (grades 7-9) in that province, to which 84 principals (89.4%) 
responded. This expanded questionnaire retained Gunn’s provision for over- 
all satisfaction and effectiveness ratings but invited satisfaction ratings on 
41 job facets, school effectiveness ratings on 38 criteria, and principalship 
effectiveness ratings on 29 criteria. Table 1 summarizes the structure of data 
collection for the three studies. 

In each of the three studies, follow-up interviews were conducted with 10 
principals in order to explore outstanding findings and other important 
issues in greater depth than was possible with the questionnaire. Inter- 
viewees were chosen to include known personal characteristics of principals 
and organizational attributes of schools in approximate proportion to the 


Table 1 
Structure of Data Collection from Principals 


Elementary Junior High Senior High 


Questionnaires 


Population of schools/principals 654 94 155 
Questionnaires distributed 131 94 155 
Questionnaires returned 112 (85.5%) 84 (89.4%) 133 (85.8%) 
Job facets investigated 41 41 35 
School effectiveness criteria investigated 38 38 9 
Principal effectiveness criteria investigated 29 29 9 
Interviews 
No. conducted 10 10 10 
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provincial random sample or population. Although this strategy did not give 
the interview findings statistical generalizability, it did help to ensure that a 
suitable range of perspectives would be taken into account in this detailed 
investigation. To this end, each interview sample of 10 principals was chosen 
with school setting, type and size of school (as measured by number of 
teachers), as well as gender of the principals in mind. The elementary school 
study is indicative: Those four aspects were accommodated generally in 
accordance with their appearance in responses from the random sample (see 
Table 2). 

The elementary and junior high school studies also investigated the im- 
portance of job facets/criteria for overall satisfaction/effectiveness, and a 
number of commonalities and discrepancies between those levels were noted. 
Another important addition was the inclusion of a check on principals’ 
perceptual biases; equivalent questionnaire data about the effectiveness of 
schools and principals were obtained from teachers and area superintendents 
associated with the schools in one major city. Of the 34 elementary schools in 
the study which fell in that city region, responses were received from 30 
principals (88.2%), from 262 (66.7%) of the 391 teachers in 32 schools, and 
from nine area superintendents responsible for 33 (97.1%) of the schools. In 


Table 2 
Characteristics of Schools and Principals in Elementary School Interviews 
(N= 10) 
Characteristic Questionnaire Sample __ Interview Sample 
%f f %f 
School Setting 
City 66.1 6 60.0 
Town 20.5 2 20.0 
Rural 13.4 2 20.0 
Type of School System 
Public District 50.0 4 40.0 
Separate District 20.5 2 20.0 
County 17.0 2 20.0 
Division 12.5 2 20.0 


gt hi 


Number of Full-Time Equivalent Teachers 


Fewer than 10 27.5 3 30.0 

10-19 51.4 4 40.0 

20-29 19.3 3 30.0 

30 or more 1.8 - - 
Se a a Bae ni A ee ee eee ee 
Sex of Principal 

Female 19.6 2 20.0 

Male 80.4 8 80.0 
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the junior high school study, responses were received from 21 (95.5%) of the 
22 principals, 132 (52%) of 254 teachers in 21 schools, and nine area superin- 
tendents representing all 22 schools. The substantial support by educators 
and administrators approached in these studies is indicative of a widespread 
professional regard for the importance of the job satisfaction and effective- 
ness issues. 

Numerical data obtained from principals were analyzed using the same 
procedures for all three studies: Mean ratings were determined for global 
variables as well as individual facets/criteria. Means were also calculated for 
ratings of importance of each facet/criterion. Spearman and Kendall rank- 
order correlations were performed to compare the rank ordering of items for 
perceived satisfaction/effectiveness and importance. (For this analysis, the 
criterion “displaying leadership by the principal” was compared with princi- 
pals’ mean ratings on the item “your overall effectiveness as a leader.”) Data 
obtained from teachers and area superintendents were initially partitioned 
by school so that school means could be calculated. Resultant figures per- 
mitted comparable means to be determined for the various constituencies 
(i.e., without influence from differences in school size). 

With regard to reliability of questionnaires, and therefore of the com- 
parison, split-half correlations using an odd-even split of items indicated high 
measures of reliability in all three studies. For example, correlation coeffi- 
cients for the job satisfaction, school effectiveness, and principalship effec- 
tiveness sections of the survey of elementary school principals were .96, .95, 
and .95. Similar figures were obtained for the other educational levels and 
other constituencies. Validity was enhanced by questionnaire development 
based on previously validated research instruments (e.g., Holdaway, 1978; 
Hoy & Ferguson, 1985; Ratsoy & Miklos, 1985; Rice, 1978); systematic and 
thorough pilot testing; and triangulation through the use of multiple con- 
stituencies, multiple data collection methods, and multiple researchers. 

Before documenting results and comparisons from the three studies, a 
note of caution is in order. Clearly the extent and validity of the comparisons 
presented in this article are limited by possible effects associated with chang- 
ing attitudes and circumstances since the time of Gunn’s 1984 survey and 
interviews, and by the more limited investigation of effectiveness in that 
study. In addition, Gunn’s data collection was confined to responses by 
principals, so a possible halo effect in those findings could not be assessed. 


Results 
Job Satisfaction of Principals 


Mean responses of principals from sampled elementary schools and popula- 
tions of junior high and senior high schools across Alberta are displayed in 
Table 3. Means for overall job satisfaction were similar for the three levels 
(elementary 5.1, junior high 5.2, senior high 5.2) with all three groups ex- 
pressing moderate satisfaction with their work; 32%, 36%, and 39% of the 
respondents regarded themselves as highly satisfied. With regard to specific 
facets, responses at all levels exceeded the six-point scale mean of 3.5 with 
the lowest mean recorded by any group on any item being 3.8. 
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Comparison of responses on specific facets revealed certain common 
perceptions as well as some marked school-level discrepancies. The three 


Table 3 
Principals’ Levels of Job Satisfaction 


Mean Levels of Satisfaction 


Job Facet Elementary Junior High Senior High 
(N= 112) (N = 84) (N = 133) 


Working Conditions 
1. The way consultation between board and teachers 
about working conditions is conducted in the 


school system 4.3 43 3.8 
2. The salary the principal receives 4.4 4.1 4.7 
3. Fringe benefits under the contract 4.6 4.3 4.7 
4. Quality of custodial services in the school 46 4.7 48 
5. The number of hours the principal is required to 
work 4.2 4.5 4.4 
6. The principal’s physical working conditions 4.6 4.9 4.7 
7. Availability of clerical staff to assist the principal 4.7 4.7 4.9 
Personnel-Related Matters 
8. The principal’s working relationships with teachers 5.4 aie) 5.4 
9. The principal’s social relationships with teachers 50) 5.0 52 
10. The teaching competence of teachers Bie 5.0 Dal 
11. The competence of teachers in handling 
professional duties outside their classrooms 5.0 4.8 4.7 
12. Attitudes of teachers toward ongoing professional 
improvement 4.9 4.6 - 
13. The help the principal gives teachers and students 
to succeed 5.0 4.9 - 
14. Attitudes of teachers toward change 4.6 45 4.5 
15. Satisfaction and morale of the staff 
(SH: Morale of staff) 5:2 5.1 5.0 
Student-Related Matters 
16. The principal’s relationships with students 5.4 5.4 5.4 
17. The attitudes of students toward education 4.6 4.6 4.4 
18. Satisfaction and morale of students 
(SH: Student “spirit”) 5.0 5.0 4.7 
19. Achievement of students 4.8 4.9 - 
Role-Related Matters 
20. The principal's freedom to change the school 
program 4.6 4.6 4.8 
21. The principal's freedom to allocate teaching 
assignments 5.0 5.4 5.4 
22. The principal's involvement in hiring teachers for 
the school 4.5 4.6 4.8 
23. Authority associated with the principal's position 4.9 Sa Bel 
24. The principal's involvement in budget preparation 4.7 4.9 4,7 
25. The principal's responsibility for formal teacher 
evaluation 5.1 5.2 ; 
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Table 3 (continued) 
Principals’ Levels of Job Satisfaction 


Mean Levels of Satisfaction 


Job Facet Elementary Junior High Senior High 
(N = 112) (N = 84) (N = 133) 


District-Related Matters 


26. The principal’s relationship with the superintendent 5.0 5.1 4.9 
27. The principal's relationships with other central 

office staff é 4.9 5.2 4.9 
28. The principal’s involvement in decision making at 

the district level 4.3 4.3 4.2 
29. Availability of useful advice to assist the principal 

with problems 4.7 4.8 4.5 
30. Opportunities for useful inservice education for the 

principal 4.7 4.9 4.2 
31. Expectations of the school board for the principal 4.4 4.6 4.6 
32. The methods used to evaluate principals 4.3 4.6 4.2 
33. Attitudes of school board administrators toward 

school teachers and administrators 4.7 4.9 4.1 


Occupation-Related Matters 


34. Attitudes of parents toward the school : Sad S21 4.9 
35. The principal’s social position in the community 5.0 5.0 Sil 
36. Achievement of the principal’s own professional 

objectives 5.0 ae . 
37. The principal’s sense of accomplishment as an 

administrator 5.1 512 5.0 
38. Recognition by others of the principal’s work 4.6 4.8 4.8 
39. The effect of the job on the principal’s personal life 4.3 4.5 4.5 
40. Opportunities for advancement as an administrator 4.5 4.5 4.6 
41. Successful completion of projects and tasks 5.1 5.0 : 
42. Your overall feeling of satisfaction with your job 5.1 3:2 o.2 
Notes: 


1. Means are based on ratings on a six-point scale ranging from 1, highly dissatisfied, to 6, 
highly satisfied. 

2. Items are taken from the elementary school instrument. Parentheses indicate differences in 
wording for equivalent items in junior high (JH) and senior high (SH) school instruments. 


groups all exhibited particular satisfaction with their working relationships 
with teachers (5.4, 5.5, 5.4), their relationships with students (5.4, 5.4, 5.4), 
and rather less consistently their freedom to allocate teaching assignments 
(5.0, 5.4, 5.4). All were much less satisfied with the various aspects of their 
working conditions (especially with board-staff consultation over working 
conditions, 4.3, 4.3, 3.8), with most district-related matters (particularly 
their involvement in central office decision making, 4.3, 4.3, 4.2), and with 
students’ attitudes about education (4.6, 4.6, 4.4). 

Junior high principals were highest on 15 job facets, senior high principals 
on eight facets, and elementary principals on six facets; other dimensions had 
tied mean ratings. Elementary principals tended to be the most satisfied 
group in respect of matters relating to staff and students, but least satisfied 
in role- and occupation-related matters. More specifically, their mean satis- 
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faction level was noticeably higher than that of junior high school principals 
in connection with teachers’ attitudes about ongoing professional growth 
(4.9, 4.6, n.a.). On the other hand, elementary school principals were consid- 
erably less satisfied with their freedom to allocate teaching assignments (5.0, 
5.4, 5.4), their physical working conditions (4.6, 4.9, 4.7), their involvement 
in hiring teachers (4.5, 4.6, 4.8), their relationships with central office staff 
other than the superintendent (4.9, 5.2, 4.9), and the number of hours they 
were required to work (4.2, 4.5, 4.4). 

Principals of junior high schools were more satisfied with matters pertain- 
ing to the district and their administrative role in the school, as well as on 
other occupation-related facets. They were less satisfied in many personnel 
matters. In specific terms, junior high principals were substantially more 
satisfied with central office attitudes toward school staff members (4.7, 4.9, 
4.1); their freedom to allocate teaching assignments (5.0, 5.4, 5.4); their 
workloads (4.2, 4.5, 4.4); their relationships with central office personnel 
(4.9, 5.2, 4.9); and their access to professional advice (4.7, 4.8, 4.5). They 
were also more satisfied with the methods used for evaluating principals at 
that level (4.3, 4.6, 4.2). Salaries (4.4, 4.1, 4.7) and fringe benefits (4.6, 4.3, 
4.7) and teachers’ attitudes about ongoing professional improvement (4.9, 
4.6, n.a.), however, provided less job satisfaction for the principals of junior 
high schools. 

With the exception of board-staff negotiations about conditions (4.3, 4.3, 
3.8), satisfaction ratings for various aspects of the principal’s working condi- 
tions tended to be highest among senior high principals. However, that group 
also had the lowest satisfaction levels on matters involving the school dis- 
trict. With regard to individual facets, differences in favor of senior high 
school principals included their salary (4.4, 4.1, 4.7) and involvement in 
hiring teachers (4.5, 4.6, 4.8). On the other hand, they were least satisfied 
with many other job facets, most particularly with school board attitudes 
toward teachers and administrators (4.7, 4.9, 4.1); board consultation with 
teachers over working conditions (4.3, 4.3, 3.8); and inservice provisions for 
principals (4.7, 4.9, 4.2). Other relatively less rewarding facets for the senior 
high school principals were teachers’ competence outside their classrooms 
(5.0, 4.8, 4.7); the satisfaction and morale of students (5.2, 5.1, 5.0); rela- 
tionships between the principal and central office staff (4.9, 5.2, 4.9); the 
availability of advice for the principal (4.7, 4.8, 4.5); and the evaluation 
procedures used to assess senior high school principals (4.3, 4.6, 4.2). 

The follow-up interviews also revealed some differing dimensions and 
levels of satisfaction, particularly at the elementary and junior high levels, 
for three respondents had had principalship experience in both types of 
schools. One concentrated on the satisfaction derived from interaction with 
students at the different stages. His current satisfaction as an elementary 
school principal had arisen merely from “closeness to children in this impor- 
tant growing and molding stage” of their lives and education. In contrast, his 
job satisfaction as a junior high school principal had been “more subject- 
oriented.” He also elaborated on the satisfaction of witnessing the rapid 
growth of the older pupils: 


275 


N.A. Johnson and E.A, Holdaway 


[At the start of the year we’d say] “we’ve got these wild and woolly junior high 
types. Let’s see how much we can accomplish in three years while we sit on 
them in one sense and stretch them in the other”.... [From the students I 
found] the satisfaction of seeing the tremendous growth ... over the three- 
year time frame with regard to children coming in—not being able to do 
anything but make squawks on the musical instrument, [having to] be sure 
they get their gym shorts on one leg at a time, the very feeble fumbling with 
the science equipment (“Don’t burn your eyebrows when you light the bunsen 
burner”)—to some very sophisticated things three years later. And the social, 
the maturational aspect would give a great deal of satisfaction. And being able 
to master the subject matter as well as keeping the lid on what some people 
call “the greatest disaster in education”—the development of the junior high 
student. 


A second principal cited behavioral problems as a particularly dissatisfy- 
ing feature of the junior high school—a feeling that was displaced at the 
elementary level by 


being able to accomplish something ... a person got to spend so much more 
time in the classrooms. Then it was a really satisfying experience to find out 
all the good things that were going on ... where their [teachers’] strengths 
were ... getting into more educational things—planning and so on—rather 
than just discipline [as before]. 


This perception accords with Leithwood’s (1986) identification of onerous 
disciplinary responsibilities for junior high school principals. On the other 


hand, a third principal in the study found other time pressures greater, and 
more dissatisfying, at the elementary level: 


Time is the biggest problem—the fact that you have these ideas, and staff say, 
“Wouldn’t it be nice to do this,” and it is within the capability of the school, of 
the staff, except that there is no time to do them properly—and I do not get 
any satisfaction out of doing a job poorly. So, rather than do a job poorly, I will 
frequently not do it at all. And I think that’s the one thing that’s much more 
so in elementary schools than in any other schools.... The time pressure, the 
lower [school level] you go, becomes greater and greater. The fact that I teach 
70% of the time gives you some idea of the time I have to spend on administra- 
tive tasks.... and there really isn’t a lot of time to do long-range planning. 


In order to identify the most important contributors to principals’ overall 
job satisfaction, the elementary and junior high school principals were also 
asked to rate the importance of each facet for overall job satisfaction, using a 
scale ranging from 1, Not Important, to 4, Extremely Important. Principals 
at both levels considered staff-related matters to be vital for their job satis- 
faction: Variables concerning principals’ working relationships with teach- 
ers, the teaching competence of teachers, and staff satisfaction and morale 
had means of 3.9 or 4.0, and percentages answering extremely important 
ranged from 93% to 99%. They also rated parental attitudes (3.9, 3.8); 
successful completion of projects and tasks (3.8, 3.6); the sense of accom- 
plishment derived from administration (3.9, 3.9); and the availability of 
clerical staff (3.8, 3.8) as particularly important elements of their overall 
satisfaction as principals. Conversely, respondents at both school levels con- 
sidered the principal’s social relationships with teachers (2.8, 2.9); social 
position in the community (2.6, 2.9); opportunities for advancement (3.1, 
3.2); and involvement in central office decision making (3.2, 3.1) to be least 
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important. Comparison of mean importance ratings by the two groups of 
respondents revealed no substantial discrepancies in perceptions, and at each 
level the most important factors were also rated highly for satisfaction. 
However, the principals in these studies tended to rate many of the items as 
at least moderately important for their overall job satisfaction. 


Principals’ Perceptions of School Effectiveness 


In global terms, principals in elementary, junior high, and senior high schools 
all rated their schools moderately highly on the effectiveness scale (5.2 on the 
six-point scale); 22%, 29%, and 27% of principals rated their schools as highly 
effective, and schools in general were judged to be somewhat more than 
moderately effective overall. School-level comparisons based on individual 
criteria were hampered by the greatly expanded array of criteria used in the 
1986-87 instrument for elementary and junior high schools. Nevertheless, 
analysis revealed a broad pattern of reasonably consistent mean ratings 


Table 4 
Principals’ Perceptions of School Effectiveness 


Means of Effectiveness Ratings 


Effectiveness Criterion Elementary Junior High Senior High 
(N= 112) (N =84) (N = 133) 


1. Setting school goals 5.0 5 - 
2. Communicating school goais to students 4.7 4.8 4.7 
3. Maintaining high expectations of students 5i 5:2 - 
4. Preparing students for junior high school 
(JH: Senior high) 4.9 Biz 4.5(a) 
(SH: [a] employment; [b] success in 
postsecondary institutions) 5.2(b) 
5. Emphasizing academic subjects 5:2 5.3 - 
6. Encouraging academic success Se Se - 
7. Frequently evaluating students according to 
defined standards 48 5.0 5.0 


8. Maximizing achievement in reading and writing 
(SH: Providing students with satisfactory skills in 


language) Sie 48 49 
9. Maximizing achievement in mathematics orl 4.9 4.9 

10. Maximizing development of social skills 5.0 5.0 - 
11. Maximizing development of creativity 4.7 4.6 - 
12. Providing worthwhile extracurricular activities 4.9 5.3 met 
13. Maximizing student satisfaction and morale Sail 5.2 - 
14. Experiencing minimal deviant behavior among 

students shy 5.0 - 
15. Enforcing behavioral rules among students 5.1 5.3 - 
16. Maintaining an appropriate school climate 5.4 5.4 - 
17. Acknowledging the achievements of staff and 

students 5.3 Sal - 
18. Maximizing the job satisfaction of individual staff 

members 4.9 4.9 - 
19. Maximizing the morale of the staff as a group Sr 5.0 - 
20. Maintaining low turnover of staff ee eh - 
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Table 4 (continued) 
Principals’ Perceptions of School Effectiveness 


Means of Effectiveness Ratings 


Effectiveness Criterion Elementary Junior High Senior High 
(N= 112) (N= 84) (N = 133) 


21. Maintaining high expectations of staff ones te ppd - 
22. Displaying staff commitment to the school 5.0 oF - 
23. Showing a high level of staff motivation 5.0 5.0 - 
24. Displaying staff cooperation and cohesion 5.2 5.0 - 
25. Discussing, planning, and preparing lessons and 

instructional materials collaboratively 4.7 4.7 - 
26. Constructively evaluating the work of teachers 4.9 5.0 - 
27. Keeping up to date with new methods and 4.9 4.8 - 

technology 
28. Demonstrating thorough knowledge of subject 

matter 5.0 5.0 - 
29. Using appropriate teaching methods 5.1 5.0 - 
30. Making efficient use of staff time or 5.0 - 
31. Making efficient use of resources 5.0 4.9 - 


32. Coping with emergencies and overloads of work 
(SH: Handling unexpected overloads of work or 


emergencies) 5.0 4.9 5.3 
33. Taking advantage of staffing changes 5.0 5:1 
34. Responding to changing community expectations 0 5.0 - 


35. Maintaining communication with the community 
(SH: Formal communication between teachers and 


parents) 5.3 5.2 4.5 
36. Obtaining support from the community 53 5.2 - 
37. Maximizing the satisfaction of parents 5.2 5.0 - 
38. Maximizing the satisfaction of the nonparent 

community 4.3 4.4 - 
39. The overall effectiveness of your school Sr Sic me 


Note: Effectiveness means are based on ratings on a six-point scale ranging from 1, highly inef- 
fective, to 6, highly effective. 


across the studies; elementary and junior high schools were highest and 
lowest in effectiveness on approximately equal numbers of criteria. Based on 
comparisons in only eight domains of the school’s work, senior high schools 
were at the top of the effectiveness comparisons on one item and at the 
bottom on four. Table 4 displays these findings. 

Principals at both elementary and junior high levels judged their schools 
most favorably in maintaining a suitable organizational climate (5.4, 5.4); 
obtaining support from the community (5.3, 5.2); communicating with the 
community (a perception not shared in senior high schools, 5.3, 5.2, 4.5); and 
emphasizing academic instruction (5.2, 5.3) and academic attainment (5.2, 
5.2). At the other extreme, both elementary and junior high school principals 
assigned relatively low effectiveness ratings to satisfying the nonparent 
community (4.3, 4.4), and their ratings for developing students’ creativity 
(4.7, 4.6) also tended to be lower than on other criteria. Even so, elementary, 


278 


Principals’ Effectiveness and Job Satisfaction 


junior high, and senior high school principals’ effectiveness ratings on these 
and all other items still exceeded the slightly effective level; in absolute 
terms, these perceptions reflected a view that schools across the educational 
spectrum were reasonably effective in all dimensions examined. 

Elementary schools emerged as more effective than junior and senior high 
schools in maximizing attainment in reading and writing (5.2, 4.8, 4.9). 
Elementary and junior high schools were substantially more effective in 
maintaining communication with the community than were senior high 
schools (5.3, 5.2, 4.5); nevertheless, the slight difference in wording of items 
about community liaison may have had some bearing on this finding. Junior 
high school principals’ responses were highest in the areas of extracurricular 
activities (4.9, 5.3, 5.1) and preparing students for subsequent commitments 
(4.9, 5.2, 4.5/5.2). In the latter regard, senior high school principals re- 
sponded separately about preparation for employment and postsecondary 
education; schools at that level seemed to equip students well for higher 
education but performed less effectively in preparing school leavers for the 
world of work. On the other hand, the senior high level was associated with 
greater flexibility in coping with work overloads and emergencies (5.0, 4.9, 
5.3). 

At the elementary and junior high levels, principals also rated the impor- 
tance of 39 criteria (including effective leadership by the principal) for judg- 
ing the overall effectiveness of schools. As with job satisfaction, a four-point 
importance scale was used. Principals in the senior high school study were 
only invited to advance three criteria that they perceived to be most impor- 
tant for this purpose, and support for each criterion was quantified in per- 
centage frequencies of mention. Within the constraints of those different 
data collection strategies, there emerged some general consensus as well as 
isolated differences of opinion about the most important school effectiveness 
criteria. Senior high school principals most frequently (65% of respondents) 
proposed staff and student satisfaction and morale as central issues, and this 
finding was supported in high ratings of equivalent criteria by respondents at 
the elementary and junior high levels (maximizing student satisfaction and 
morale, 3.9, 3.9; maximizing staff job satisfaction, 3.9, 4.0). Many of the 
senior high school principals (54%) also commended preparation for further 
education as a priority, although principals at the two lower educational 
levels were less adamant about the importance of this concern (3.4, 3.5). 
(Comparability on this criterion was limited by the dual focus of the finding 
from senior high school responses.) 

On the other hand, senior high school principals made no mention of 
many other criteria that principals in elementary and junior high schools 
rated as important, particularly the high ratings both groups assigned to 
maintaining an appropriate school climate, acknowledging achievements of 
staff and students, displaying leadership by the principal, using appropriate 
teaching methods, and maintaining high expectations of students and staff 
(all means 3.9 at both levels). Indeed elementary and junior high school 
principals rated each of the 39 criteria as important to some degree; hence 
criteria other than the four recorded in the senior high school findings may 
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also be of consequence for judging effectiveness at that school level. At the 
same time, emphases were evident in the elementary and junior high school 
results: Respondents had much less regard for the importance of nonparent 
community satisfaction (3.0, 3.0); low turnover of staff (3.2, 3.2); collabora- 
tive work by staff (3.3, 3.4); regular evaluation of students (3.2, 3.5); and 
academic emphasis (3.5, 3.3) than for the other, more highly rated criteria. 

Comparison of school effectiveness means with importance means at each 
level was also revealing. For elementary schools, a Spearman rank-ordered 
correlation coefficient of .62 (p<.01) indicated a high degree of association of 
effectiveness and importance rankings. This was tempered, however, by the 
results of Kendall correlation analysis (r=.46; p<.01), a more meaningful 
procedure in this instance, where the data included several tied ranks. With 
regard to performance on individual criteria, the school climate variable 
emerged as the most effective dimension and the most important criterion on 
which to base school effectiveness appraisals. Four other variables also ex- 
hibited broadly consistent effectiveness and importance rankings of means: 
acknowledging the achievements of staff and students (effectiveness rank 4 
of 39 items; importance rank 8); displaying staff cooperation and cohesion (6; 
10.5); maximizing achievement in reading and writing (8; 3); and maintain- 
ing high expectations of staff (9; 10.5). On the other hand, elementary 
schools were judged to be relatively ineffective in some important areas: 
maintaining high expectations of students (16.5; 2); displaying leadership by 
the principal (14; 4); maximizing student satisfaction and morale (13; 5); 
maximizing staff morale (20; 6); using appropriate teaching methods (18; 7); 
and maximizing achievement in mathematics (15; 9). 

At the junior high level, the conjunction of school effectiveness and impor- 
tance rankings was much lower (Spearman r=.28, p<.01; Kendall r=.21, 
p<.01). As with elementary schools, school climate was rated as highly 
effective as well as important (1; 2.5) as was maintaining high expectations 
of students (8.5; 4). However, several other criteria were rated as important 
but relatively ineffective, most particularly maximizing staff morale (18.5; 
1), acknowledging achievements of staff and students (14; 2.5), maximizing 
the job satisfaction of staff (28; 7), and using appropriate teaching methods 
(20.5; 10). 

The importance rankings in these studies resembled only partially the 
“recipes” commonly appearing in some school effectiveness literature. Paral- 
lel factors were school climate (elementary school importance rank 1; junior 
high 2.5); the principal’s leadership (4; 5); and high expectations of students 
(2; 4). However, other widely commended criteria were not highly ranked: 
evaluation of students (37; 30); academic emphasis (30; 37); and encouraging 
academic success (16.5; 18.5). Additional important factors also emerged as 
central to school effectiveness appraisal: staff morale (6; 1); student satisfac- 
tion and morale (5; 8); and, at the elementary level, achievement in reading 
and writing (3; 12.5). 
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Principals’ Perceptions of Their Effectiveness as Leaders 


As with their mean ratings of overall effectiveness of schools, principals of 
the elementary, junior high, and senior high schools across Alberta had 
common views about their own overall effectiveness. Means for all three 
groups were 5.1, slightly beyond the moderately effective level on the six- 
point scale used in each study, with 17%, 21%, and 21% rating themselves as 
highly effective school administrators. Further comparisons are limited by 
the restricted selection of criteria used in the senior high study; but, as the 
data in Table 5 indicate, some patterns could be discerned. 

As with the global assessments, there was some agreement about the 
range of effectiveness criteria used for self-appraisal. Across the school levels, 
assessments of effectiveness were highest for making timely, appropriate, 
and acceptable decisions (5.3, 5.3, 5.3) and providing an appropriate work 
environment for students and staff (5.3, 5.2, 5.4); and elementary and junior 
high school principals’ perceptions were also especially favorable regarding 
exercising exemplary behavior at school (5.4, 5.3), communicating with staff 
(5.3, 5.3), encouraging high expectations of students (5.1, 5.3), and allocating 
tasks appropriately among staff (5.1, 5.3). Principals, however, rated their 
effectiveness in enlisting the support of the nonparent community consider- 
ably lower than on other criteria (4.3, 4.3), although their common judgment 
was still in the range of effective performance. 

Substantial school-level differences in performance did occur on certain 
criteria. A major discrepancy related to senior high school principals’ com- 
paratively low mean rating of effectiveness in improving the performance of 
staff (4.9, 5.0, 4.5). Conversely, these principals were the most positive group 
in respect of responding to changing circumstances and expectations in the 
organizational environment (4.9, 4.8, 5.1). 

Principals in the elementary and junior high school studies also rated the 
importance of 29 criteria for assessing the overall effectiveness of principals. 
Most respondents regarded all criteria as important to some degree. Respon- 
dents at both levels assigned particularly high importance ratings to com- 
municating with staff (4.0, 3.9 on the four-point scale); encouraging high 
expectations of students (3.9, 3.9); and making timely, appropriate, and 
acceptable decisions (3.9, 3,9). The lowest importance ratings were for enlist- 
ing the support of the nonparent community (3.0, 3.0), but this still repre- 
sented a perception of moderate importance. Other relatively low mean 
ratings included communicating with groups in the community (3.5, 3.3) and 
responding to external expectations and changes (3.4, 3.4). 

For elementary schools, comparison of effectiveness and importance 
means indicated a high correlation between the two sets of variables: Spear- 
man r=.65 and Kendall r=.51 (both p<.01). On individual criteria, prin- 
cipals rated themselves as highly effective on the following highly important 
criteria: communicating with staff (effectiveness rank 3 of 29 items; impor- 
tance rank 1); making timely and appropriate decisions (2; 4); promoting 
high expectations of students (10; 3); fostering high staff and student morale 
(8; 5); and communicating with students (5.3; 9). Conversely, elementary 
school principals judged themselves to be relatively ineffective in some im- 
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Table 5 
Principals’ Perceptions of Their Effectiveness as Leaders 


Means of Effectiveness Ratings 


Effectiveness Criterion Elementary Junior High Senior High 
(N= 112) (N =84) (N = 133) 


—_ 


. Coordinating the development of school goals 5.1 Sl - 
2. Publicizing school goals 5.0 4.9 - 
3. Promoting the achievement of school goals 

(SH: Directing the efforts of teachers toward school 


goals) Su 5.0 - 
4. Exercising instructional leadership 5.0 5.0 - 
5. Making timely, appropriate, and acceptable 
decisions 5:3 5.3 5.3 
6. Improving the performance of staff 4.9 5.0 4.5 
7. Evaluating staff members 5.0 5.1 - 
8. Providing feedback to staff 51 Sa - 
9. Communicating with staff NS) 5) - 
10. Encouraging high expectations of students 5.1 5.3 - 
11. Providing an appropriate work environment for 
students and staff 5.3 De 5.4 
12. Promoting high expectations among staff members ey De 
13. Fostering high morale among staff and students 
(SH: Improving the morale of teaching staff [only]) 5.2 Sat 5.0 
14. Communicating with students 52 5.2 
15. Obtaining qualified staff 5.0 4.9 - 
16. Allocating tasks appropriately among staff 5.1 5.3 - 
17. Allocating resources 5.2 5.2 - 
18. Coordinating and integrating the activities of staff 
groups/departments 4.9 5:0 5.0 
19. Coping with uncertainty and conflict 0 5:2 et 
20. Coping with emergencies and overloads of work ei 5.0 - 
21, Fostering the professional growth of staff members 5.0 5.0 : 
22. Exercising exemplary behavior at school 5.4 5.3 - 
23. Increasing the job satisfaction of staff members 5.0 5.0 4.9 
24. Working with teachers to develop or change 
policies 5.0) 5.0 5.2 
25. Identifying community expectations 4.9 4.8 - 


26. Adapting policies and procedures to respond to 
external changes and expectations (SH: To 
accommodate change initiated by the external 


environment) 4.9 4.8 54 
27. Communicating with community groups 4.9 4.7 - 
28. Enlisting the support of parents 5.2 shu - 
29. Enlisting the support of the nonparent community 4.3 4.3 - 
30. Your overall effectiveness as a leader Bat 5.1 5a 


portant areas: obtaining qualified staff (17; 2); providing feedback to staff 
(13; 6); and improving the performance of staff (28; 7.5). 

The correspondence of effectiveness and importance rankings was not so 
close at the junior high level: Spearman r=.55 and Kendall r=.43 (both 
p<.01). As with elementary schools, communication with staff attracted 
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relatively high effectiveness and importance ratings (2; 1). Also highly rated 
for effectiveness and importance were the principal’s decision making (4; 3), 
encouragement of high expectations among students (5; 2) and staff (8.5; 
4.5), and communication with students (7; 10). Very important criteria that 
were rated as relatively ineffective related to improving the performance of 
staff (17; 4.5), obtaining qualified staff (20; 6), fostering high staff and 
student morale (13; 7.5), and coordinating the development of school goals 
(14; 9). 


Teachers’ and Area Superintendents’ Perceptions of School Effectiveness 


In the elementary and junior high school studies, parallel data about effec- 
tiveness of schools and principals were obtained from teachers and area 
superintendents associated with the schools in one large city. Comparisons of 
these response means revealed some clear patterns in the perceptions of 
educational constituencies. 

Teachers rated school effectiveness on the same criteria as did principals; 
the mean ratings are displayed in Table 6. Unlike the comparative assess- 
ments of principals, junior high teachers tended to judge effectiveness less 
positively than did the elementary school teachers. Of the 38 items on which 
responses were invited, lower ratings were assigned by teachers in the junior 
high schools in all but four instances. (Ratings on one further item, display- 
ing leadership by the principal, are dealt with in the next section.) Moreover, 
most of the differences were substantial: The greatest disparities related to 
effectiveness in maximizing staff morale (4.7, 4.1); maintaining low staff 
turnover (4.8, 4.3); and maximizing students’ attainment of important social 
skills (4.9, 4.3). Twenty-two other mean ratings by junior high school teach- 
ers were also at least 0.3 (on the six-point scale) below those of colleagues in 
elementary schools. This pattern of school-level differences was also ex- 
hibited in a slightly lower overall effectiveness rating by junior high school 
teachers (5.0, 4.9). Only in the matter of providing worthwhile extracur- 
ricular activities did junior high schools emerge as substantially superior (4.6, 
5.0); preparation for the succeeding school level was another dimension 
which favored those schools slightly (4.8, 4.9). 

One common feature in the responses was the relatively low ratings 
assigned at each school level to effectiveness in maximizing the satisfaction 
of the nonparent community (4.3, 4.0), preparing work collaboratively (4.3, 
4.0), maximizing staff satisfaction (4.6, 4.2) and morale (4.7, 4.1), and ex- 
hibiting minimal deviant behavior by students (4.7, 4.3). Even so, mean 
ratings on these items were at the slightly effective level or beyond. Criteria 
on which the two groups judged effectiveness highest also tended to coincide: 
Both elementary and junior high school teachers favored the performance of 
their schools in demonstrating thorough knowledge of subject matter (5.0, 
5.0); maintaining communication with the community (5.2, 4.9); and using 
appropriate teaching methods (5.1, 4.9). 

Equivalent data were gathered from area superintendents on a more 
limited array of school effectiveness items—those on which they could be 
expected to make judgments (Table 7). Area superintendents rated the 
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elementary and junior high schools as equally effective overall—both slightly 
less than moderately effective. According to these respondents, the dimen- 
sions of greatest and least effectiveness differed according to school level: 
Elementary schools were perceived to be particularly effective in setting 
school goals (5.0) and maintaining high expectations of staff (5.0), whereas 
junior high schools performed best in taking advantage of staff changes (5.3), 


Table 6 
Teachers’ Perceptions of School Effectiveness 


School Means of Effectiveness Ratings 


Effectiveness Criterion Elementary Junior High 
(N = 32) (N= 21) 
1. Setting school goals Sail 4.7 
2. Communicating school goals to students 4.8 4.6 
3. Maintaining high expectations of students 5.0 4.7 
4. Preparing students for junior high school (JH: 
Senior high) 4.8 4.9 
5. Emphasizing academic subjects 5,1 4.9 
6. Encouraging academic success Sel 5.0 
7. Frequently evaluating students according to 
defined standards ; 4.8 4.8 
8. Maximizing achievement in reading and writing 51 4.7 
9. Maximizing achievement in mathematics ey 4.9 
10. Maximizing development of social skills 4.9 4.4 
11. Maximizing development of creativity 4.5 4.1 
12. Providing worthwhile extracurricular activities 4.6 SO 
13. Maximizing student satisfaction and morale 5.0 4.7 
14. Experiencing minimal deviant behavior among 
students 4.7 4.3 
15. Enforcing behavioral rules among students 4.8 4.4 
16. Maintaining an appropriate school climate oN) 4.6 
17. Acknowledging the achievements of staff and 
students 4.9 46 
18. Maximizing the job satisfaction of individual staff 
members 4.6 4.2 
19. Maximizing the morale of the staff as a group 4.7 4.1 
20. Maintaining low turnover of staff 4.8 4.3 
21. Maintaining high expectations of staff 4.9 4.7 
22. Displaying staff commitment to the school 4.9 4.6 
23. Showing a high level of staff motivation 4.9 4.5 
24. Displaying staff cooperation and cohesion 4.8 4.5 
25. Discussing, planning, and preparing lessons and 
instructional materials collaboratively 4.3 4.0 
26. Constructively evaluating the work of teachers 4.6 4.4 
27. Keeping up to date with new methods and 
technology 4.9 4.6 
28. Demonstrating thorough knowledge of subject 
matter 5.0 5.0) 
29. Using appropriate teaching methods Dal 4.9 
30. Making efficient use of staff time 4.9 4.5 
31. Making efficient use of resources 4.8 4.6 
32. Coping with emergencies and overloads of work 47 4.3 
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Table 6 (continued) 
Teachers’ Perceptions of School Effectiveness 


School Means of Effectiveness Ratings 


Effectiveness Criterion Elementary Junior High 
(N = 32) (N= 21) 

33. Taking advantage of staffing changes 4.7 4.3 
34. Responding to changing community expectations 4.8 4.6 
35. Maintaining communication with the community Sia 4.9 
36. Obtaining support from the community 5.0 4.6 
37. Maximizing the satisfaction of parents ot 4.8 
38. Maximizing the satisfaction of the nonparent 

community 4.3 4.0 
39. Displaying leadership by the principal 5a 4.8 
40. The overall effectiveness of your school 5.0 4.9 

Table 7 


Area Superintendents’ Perceptions of School Effectiveness 


Means of School Effectiveness Ratings 


Effectiveness Criterion Elementary Junior High 
(N of schools = 33) (N of schools = 22) 


1. Setting school goals 5.0 4.8 
2. Communicating school goals to students 48 oO 
3. Maintaining high expectations of students 4.9 5.0 
4. Emphasizing academic subjects 4.8 5.0 
5. Maximizing academic achievement 4.6 4.8 
6. Providing worthwhile extracurricular activities 4.5 5.1 
7. Maximizing student satisfaction and morale 4.5 4.5 
8. Maintaining an appropriate school climate 4.7 4.5 
9. Acknowledging staff and student achievements 49 4.7 
10. Maximizing the job satisfaction of individual staff 
members 4.8 4.5 
11. Maintaining high expectations of staff 5.0 5.0 
12. Displaying staff cooperation and cohesion 4.8 4.6 
13. Keeping up to date with new methods and 
technology 4.6 4.6 
14. Demonstrating thorough knowledge of subject 
matter 4.9 4.9 
15. Using appropriate teaching methods. 4.6 4.6 
16. Taking advantage of staffing changes 4.6 Le | 
17. Responding to changing community expectations 4.7 4.7 
18. Obtaining support from the community 4.8 4.7 
19. Maximizing the satisfaction of parents 4.8 4.8 
20. Maximizing the satisfaction of the nonparent 
community 4.4 4.1 
21. Overall school effectiveness 4.7 4.7 


seen Ta 
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providing worthwhile extracurricular activities (5.1), communicating school 
goals to students (5.0), maintaining high expectations of students (5.0) and 
staff (5.0), and emphasizing academic subjects (5.0). Both types of schools 
were rated least effective in maximizing the satisfaction of the nonparent 
community (4.0, 4.0). 

There was no consistent school-level difference across the area super- 
intendents’ effectiveness ratings on individual criteria. Elementary schools 
were more effective in maximizing staff job satisfaction (4.8, 4.5) and satis- 
faction of the nonparent community (4.4, 4.1), whereas the differences 
strongly favored junior high schools for taking advantage of staffing changes 
(4.6, 5.3) and providing worthwhile extracurricular activities (4.6, 5.3). 

Two further observations also should be recorded. First, comparison of 
the means in Tables 3, 6, and 7 reveals that, at both levels, principals’ ratings 
of school effectiveness were typically higher than those of teachers and area 
superintendents, and area superintendents viewed both school levels with 
somewhat less approval than that exhibited by the other two constituencies. 
The second pervasive attribute revealed by such a comparison is one of 
difference between the school levels: While principals and area superinten- 
dents expressed perceptions that were either approximately equivalent or 
that at least balanced out across the range of criteria, elementary school 
teachers regarded their organizations as considerably more effective than did 
junior high school teachers. 

As with principals, elementary and junior high school teachers also rated 
the importance of 39 criteria for judging the school effectiveness. Responses 
by teachers at the two levels were similar for most criteria. Both groups 
regarded the following criteria as especially important: maintaining high 
expectations of students (3.8, 3.8 on the four-point scale); maintaining an 
appropriate school climate (3.8, 3.8); and displaying leadership by the prin- 
cipal (3.8, 3.8). Both groups also agreed about the least important criteria: 
satisfaction of the nonparent community (2.8, 2.8); low turnover of staff (3.0, 
3.0); collaborative work by teachers (3.1, 3.0); frequent evaluation of stu- 
dents (3.1, 3.4); provision of extracurricular activities (3.1, 3.4); responses to 
changing community expectations (3.3, 3.2); and the advantage taken from 
changes in staff (3.2, 3.2). In each instance, however, both groups further 
admitted all criteria as important elements for school effectiveness appraisal. 
On a small number of items, importance was viewed differently by teachers 
at the two school levels. Elementary school teachers placed greater emphasis 
on the development of important social skills (3.8, 3.5) and acquisition of 
community support (3.6, 3.3) and assigned lesser importance to evaluating 
students regularly (3.1, 3.4) and providing worthwhile extracurricular ac- 
tivities (3.1, 3.4). 

The nine area superintendents were invited to record the most important 
criteria for assessing organizational effectiveness in elementary, junior high, 
and senior high schools. Whereas seven considered that the same criteria 
apply to all school levels, the other two believed different criteria should be 
used. As one area superintendent noted, at the junior high school level 
greater emphasis should be placed on maintaining “firm and consistent 
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discipline and student expectations.” The second contended that criteria 
should be child-centered in elementary schools but content-oriented at higher 
levels. 


Teachers’ and Area Superintendents’ Perceptions of the 
Effectiveness of Principals 


One item in the questionnaires completed by teachers required a rating of the 
principal’s effectiveness. Teachers in elementary schools were more positive 
than were those at the junior high level (means 5.1, 4.8 on the six-point 
scale). Area superintendents, however, did not share teachers’ view of great- 
er leadership effectiveness in elementary schools, preferring to rate junior 
high principals as marginally more effective overall (4.7, 4.8). Further insight 
about area superintendents’ responses was obtained through ratings of 12 
leadership effectiveness criteria. Principals at the two levels were judged to 
be superior in different domains, although only in the matter of allocating 
tasks appropriately among staff (4.6, 4.9) was there evidence that one school 
level Gunior high) performed at a substantially higher level. The general 
pattern was one of similar effectiveness across the school levels; in par- 
ticular, both groups of principals were rated highly in allocating resources 
(5.0, 5.0); providing an appropriate work environment for students and staff 
(5.0, 4.8); providing feedback to staff (4.9, 4.8); and publicizing school goals 
(4.9, 4.8). Elementary and junior high school principals alike were regarded 
as least effective in enlisting the support of the nonparent community (4.3, 
4.2). 


Other School-Level Differences 


Although the foregoing findings and comparisons focused on job satisfaction 
and school effectiveness, unsolicited comments by two teachers in the ele- 
mentary school study also indicated perceptions of other critical differences 
among educational levels. One teacher’s questionnaire recorded differential 
educational opportunities in elementary and high schools as a result of 
influence in the school system office: “Central office staff ... backgrounds and 
interests lie in high school education, so elementary education becomes 
neglected—the poor cousin in the system.” Another statement adds weight 
to this assertion: 


Elementary schools and elementary teachers have always been treated as 
second-class citizens. Junior and Senior High schools can always find money 
to provide prep time for their teachers. Junior and Senior High schools 
always find a way to dismiss their kids a day or two or even much more before 
a major holiday or semester break yet elementary teachers are obliged to 
spend all day, every day, “right to the bitter end” with their students. 
Elementary teachers are required to fill out long and detailed report cards on 
every child while their colleagues in Junior and Senior High only fill in the 
appropriate circle on a computer sheet and the computer does the rest, even 
down to a sterile comment on the student’s progress. Elementary teachers 
conduct lengthy and formal teacher-parent conferences at report-card time 
while Junior and Senior High teachers engage most often in large poorly 
structured interviews with all the privacy that a school gymnasium can 
afford. Large numbers of elementary teachers, a few Junior High teachers 
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and no Senior High teachers spend large amounts of time and energy in 
improving their teacher effectiveness. 


While the researchers did not invite teachers’ perceptions of school-level 
differences in effectiveness or other matters, such an expression of frustra- 
tion and concern as this underscores the importance of this domain of 
inquiry. It also draws attention to the need to emphasize the perceptions of 
the teaching constituency in future research initiatives. 


Discussion 
Comparison of job satisfaction responses across the elementary, junior high, 
and senior high school levels revealed that, overall, the three categories of 
principals all experienced moderate satisfaction with their jobs. The school- 
level consistency of overall satisfaction (means were all within 0.1) contrasts 
with earlier findings (Holdaway, 1978; MacQueen & Ignatovich, 1986) about 
school-level discrepancies in the overall satisfaction of teachers. 

In the absence of more detailed investigation of principals’ job satisfac- 
tion, such consistency could cast doubt on the worth of comparative studies. 
However, while overall satisfaction at all levels was moderate and similar, 
and although all levels in the three studies were also associated with at least 
slight satisfaction in each job dimension investigated, the occurrence of some 
substantial discrepancies highlights both the relevance of detailed compara- 
tive study of organizational types and a clear basis for planning and defend- 
ing remedial action. 

Across the school levels, but particularly in senior high schools, relatively 
low satisfaction ratings were assigned to principals’ satisfaction with their 
role in district decision making and in district negotiations with teachers 
over working conditions—factors that, like most others, principals regarded 
as at least moderately important for their overall job satisfaction. The find- 
ings about workloads, especially those of elementary school principals, were 
surprising and may be of concern. The comparison of workloads across the 
three levels contrasts with earlier observations in the United States (Kmetz 
& Willower, 1982; Martin & Willower, 1981) that elementary principals work 
fewer hours and at a less hectic pace. Clearly the United States perception 
was not shared by principals in Alberta. For junior high principals in par- 
ticular, salary and benefits warrant attention. Some compensation for the 
difficult task of managing the behavior of adolescent children (according to 
one interviewee quoted earlier, “the greatest disaster in education”; see also 
Leithwood, 1986) may be worth considering. The most marked difference 
among the three school levels related to senior high principals’ relatively low 
satisfaction with personal opportunities for administrative inservice. A de- 
cade earlier, Holdaway (1978) found a similar difference between elementary 
and senior high school teachers’ levels of satisfaction with inservice educa- 
tion. Apparently, senior high personnel in general feel less satisfied about 
professional development opportunities; some effort to isolate and cater for 
their perceived inservice needs may be worthwhile. Principals’ relationships 
with superordinates at all levels also seem to deserve closer research atten- 
tion, to assess the quality and extent of district-school interaction. From an 
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administrative point of view, district administrators may do well to consider 
ways of rectifying elementary principals’ perceived overcommitment, giving 
junior high school principals more cause to be satisfied with extrinsic re- 
wards, and generally improving communication with the school executives 
under their jurisdiction. Perhaps something can be learned from the satisfy- 
ing associations principals already seem to have within their schools. 

_ According to elementary and junior high school principals, a variety of 
facets are important components of job satisfaction for both classes of prin- 
cipals. Indeed these groups tended to have similar priorities in connection 
with the relative importance of the facets specified. What did emerge from 
the importance ratings is a clear indication of the complexity of job satisfac- 
tion research. Overall job satisfaction of school administrators seems to 
depend on many facets of the job, and researchers need to direct future 
attention accordingly. One matter not explored in these studies but which 
could broaden the scope of research still further is the interconnection be- 
tween work variables and individuals’ personal lives as influences on job 
attitudes (see Blase & Pajak, 1986; Fraser, 1983). Future studies in this area 
may need to be approached in more expansive fashion. 

The generally high ratings of overall effectiveness of schools across the 
three groups of principals should encourage practitioners. In addition, all 
types of schools were judged to be effective to some extent even on the 
criteria that were associated with the lowest levels of performance—although 
the small number of effectiveness criteria in Gunn’s (1984) study makes such 
assertions about senior high schools difficult. Notwithstanding the limitation 
on detailed performance assessment and comparison, performance was also 
regarded as moderately effective (5.0) or better on four of the standard school 
effectiveness criteria—high expectations, leadership, climate, and academic 
focus (see Edmonds, 1979; Lezotte, 1986), but not on evaluation of elemen- 
tary school students (4.8). Insofar as these dimensions are accepted as cen- 
tral to effective performance, principals’ ratings indicate general success in 
Alberta schools. Senior high schools were perceived to be considerably more 
effective than other schools in coping with emergencies and overloads of 
work. Junior high schools had the best effectiveness ratings in extracur- 
ricular activities and preparation of students for subsequent pursuits. The 
elementary school principals responded most favorably about maintaining 
communication with the community and maximizing reading and writing 
achievement. 

The finding that elementary schools enjoy a closer relationship with their 
communities supports Leithwood’s (1986) assertion. However, the relatively 
poor rating of senior high schools is of concern, as are the elementary and 
junior high ratings about their nonparent communities. These findings high- 
light a need to examine the school-community issue more closely in future 
research. In view of the current worldwide trend toward school-based man- 
agement and community involvement in educational decision making (see, 
e.g., Caldwell & Spinks, 1988; Coopers & Lybrand, 1988; New South Wales, 
1989), research in this vein may be a matter of urgency. 
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The superior effectiveness of elementary schools in reading and writing 
may be attributable merely to a priority at this level on development of 
fundamental skills, compared with more diverse emphases and specializa- 
tions (e.g., behavior management, extracurricular activities, career prepara- 
tion) that are commonly expected of junior and senior high schools. Equally 
clear, however, is the support from school effectiveness literature (e.g., Ed- 
monds, 1979; Purkey & Smith, 1982) for an academic focus in the school, 
which necessarily includes reading and writing. 

The findings about important criteria only partially supported the school 
effectiveness checklists often appearing in the literature (factors supported 
were school climate/atmosphere, leadership by the principal, and high expec- 
tations of students). However, additional criteria emerged as important 
(staff morale, student satisfaction and morale, and achievement in reading 
and writing); and other widely commended criteria were not highly ranked 
(evaluation of students, academic emphasis, and encouraging academic suc- 
cess). Moreover, the elementary and junior high studies revealed that all 39 
dimensions investigated are of some consequence at those levels, tending to 
support criticisms (e.g., Good & Brophy, 1986; Mackenzie, 1983) of literature 
(e.g., Edmonds, 1979; Lezotte, 1986) that represents a small collection of 
attributes as sufficient for judging overall success. Therefore, for future 
diagnosis and research on school effectiveness, the more complex and com- 
prehensive organizational effectiveness models should be preferred, and the 
findings from theoretically grounded and more wide-ranging school effec- 
tiveness investigations (e.g., Hoy & Ferguson, 1985; Ratsoy & Miklos, 1985) 
should be regarded as more dependable and informative. 

Implications also accrue from the importance-effectiveness comparisons. 
While elementary and junior high schools both emerged as highly effective on 
the important matter of maintaining a suitable school climate, performance 
at one or both levels was judged less favorably on several other important 
criteria. The following aspects of elementary schools seem to warrant close 
inspection by school and district administrators: expectations of students, 
leadership by principals, student satisfaction and morale, staff morale, teach- 
ing methods, and mathematics achievement. At the junior high level, atten- 
tion should focus on staff-related matters: staff morale and satisfaction, 
acknowledgment of achievements by staff and students, and teaching meth- 
ods. At the same time, the high absolute mean rating of effectiveness ac- 
corded to each of these items places these apparent problem areas in 
perspective and thereby may afford practitioners some consolation. 

With regard to leadership effectiveness, principals across the three levels 
offered similarly positive appraisals of their own performance. Common to 
all levels were the especially high ratings of effectiveness in making timely, 
appropriate, and acceptable decisions and in providing an appropriate work 
environment for students and staff. Some criteria represented points of 
differing opinion. Senior high school principals saw themselves as substan- 
tially less effective than other principals in improving the performance of 
staff. This finding is to be expected: As noted earlier, principals at this level 
are known to be more distanced from classroom preparation and instruction 
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(Kmetz & Willower, 1982; Martin & Willower, 1981), mainly as a result of 
large school size, staff and subject specialization (Hallinger & Murphy, 1987; 
Jones, 1988), and consequent loose coupling of organization structure and 
control (Herriott & Firestone, 1984). In accordance with Jones’ (1988) obser- 
vation, attention needs to be given to ways that senior high principals can 
influence effectiveness through the departmental structure; research, dis- 
trict policy, and principals’ own actions are at issue here. 

In contrast to the foregoing finding, senior high school principals emerged 
on self-assessments as the most effective administrators in responding to 
changing environmental circumstances and expectations. This finding is 
especially interesting in view of the relatively poor performance ratings 
senior high principals attributed to the effectiveness of their schools in 
communicating with the community. These principals may have seen them- 
selves as having to be pivotal links between an unresponsive staff and 
community feeling—a view that conflicts with Kmetz and Willower’s (1982) 
comparison but supports the more general literature (e.g., Payne, 1988) 
about the role of principals in balancing community, school, and other com- 
peting demands. Unfortunately, Gunn’s (1984) senior high school study 
obtained responses about principals’ perceived responsiveness to externally 
initiated change but not specifically about their ongoing communication with 
the community. Accordingly, speculation about reasons for the relative adap- 
tability of senior high principals and poorer school-community interaction at 
that level needs to be supported by future research. When matched with 
principals’ relatively low self-assessments about community liaison, as well 
as the current climate of devolution of administrative control to schools, it 
seems vital to study and compare the boundary-spanning activities and 
effectiveness of principals at the three school levels. From a practical 
standpoint, higher education and inservice programs may need to be revised 
to assist principals with community relations. 

Elementary and junior high school principals agreed on the importance of 
a wide range of criteria for judging the effectiveness of principals in those 
schools. This finding supports the conclusion reached above about school 
effectiveness criteria and appraisals: Assessments of administrative perfor- 
mance and research initiatives confined to the conventional effectiveness 
formulae which currently dominate effective schools programs and literature 
should be viewed as inadequate and, therefore, should be discouraged. 

Equally interesting, however, is the finding that principals in these stud- 
ies regarded effective leadership as relating most strongly to intraorganiza- 
tional performance. Not only was effectiveness rated lowest on community- 
related criteria, but principals also viewed those criteria as least important to 
their effectiveness as leaders. As noted above, district administrators and 
educational policy makers may wish to see the perspective of school ad- 
ministrators broadened to include a stronger focus on community involve- 
ment and opinion. Lacs 

Elementary school principals regarded themselves as least effective in 
some areas of particular importance: obtaining qualified staff, providing 
feedback to staff, and improving the performance of staff. Highly important 
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criteria on which junior high principals rated themselves as relatively ineffec- 
tive were improving the performance of staff, obtaining qualified staff, fost- 
ering high staff and student morale, and coordinating the development of 
school goals. While absolute ratings need to be considered (assessments on all 
were at least slightly effective), each of these matters could provide a focus 
for more detailed research attention and district intervention. 

Although the elementary and junior high studies of teachers and area 
superintendents were geographically more confined than the studies of prin- 
cipals, some noteworthy conclusions about effectiveness of schools and prin- 
cipals should be mentioned. Regarding teachers’ assessments, junior high 
schools were rated as less effective than elementary schools, overall and on 
most criteria examined—markedly so for staff morale, stability of staff, and 
development of students’ social skills. The effectiveness of principals was 
rated likewise, and junior high schools were lower on all of the conventional 
school effectiveness dimensions mentioned earlier in this article. This gener- 
al pattern is consistent with other findings about junior high school teachers’ 
work attitudes (Holdaway, 1978; MacQueen & Ignatovich, 1986) and may be 
associated with the demanding nature of teaching adolescents (see Burlin- 
ghame, 1988; Kratzmann et al., 1980). 

In contrast, area superintendents regarded elementary and junior high 
schools as being of equivalent overall effectiveness, and no pattern of school- 
level differences was apparent in the area superintendents’ responses about 
effectiveness in specific domains. Moreover, area superintendents rated ju- 
nior high school principals marginally higher than elementary school princi- 
pals overall; and their ratings of effectiveness on individual criteria indicated 
that principals at the two levels were superior in different areas of work. 

With regard to perspectives of the three constituencies (principals, teach- 
ers, and area superintendents), two patterns are important. First, those 
directly responsible for school success tended to rate global and criterion 
effectiveness highest; those furthest removed from the school’s operation, 
the area superintendents, rated schools and principals lowest. Second, re- 
spondents tended to rate effectiveness higher in domains of personal respon- 
sibility; that is, teachers in classroom matters and principals in relation to 
leadership effectiveness. These findings highlight the need for multiple-con- 
stituency studies and appraisals of effectiveness, as organizational effective- 
ness writers from other disciplines have advocated (Cameron & Whetten, 
1983; Hasenfeld, 1983; Scott, 1987). 

Teachers in elementary and junior high schools regarded all of the spec- 
ified criteria as important to some degree for judging the effectiveness of 
schools. The two groups also displayed similar views about the relative 
importance of most criteria. The criteria proposed by area superintendents 
were somewhat different, although most of those respondents considered 
that equivalent criteria could be used to appraise effectiveness at all educa- 
tional levels. These findings further support the broad views of effectiveness 
espoused in organizational effectiveness literature from neighboring dis- 
ciplines and cast doubt on the adequacy of some school effectiveness re- 
search. 
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Conclusion 

Although the three studies examined here each investigated job satisfaction 
and effectiveness in a specific organizational context, as elements of a larger 
program of inquiry they contributed to systematic expansion of knowledge in 
these domains of organizational behavior and attitudes. Comparison of the 
three school levels in one educational region produced evidence of some 
substantial differences as well as organizational similarities, giving rise to 
recommendations for specific research attention and practical intervention. 

Likewise, comparative research of this kind should be appropriate for 
several other administrative and organizational attributes of schools such as 
culture and climate, stress, motivation, program development, and personnel 
and program evaluation. Moreover, school level represents only one of the 
available classifications—organization type is another. In Britain and 
Australia especially, government and independent/private schools represent 
fertile ground for comparison. Already, isolated studies (e.g., Weidenman, 
1989) have disclosed important differences in the behavior of government 
and independent school principals. 

Much comparative work remains to be done. Only by pursuing research in 
a systematic manner such as that described in this article can the pervasive 
and the distinctive attributes of various types of schools and administration 
be identified. Only then can understanding be advanced in a way that will 
highlight special needs and satisfy calls (e.g., Griffiths, 1988; Hallinger & 
Murphy, 1986; Holdaway & Millikan, 1980; Johnson, 1987; Kmetz & Wil- 
lower, 1982) for the development of theories to suit specific educational levels 
and organizational types. 
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EDITORIAL 


On Methodological Sects in Educational Research 


In past editorials I have commented on the need to view educational pheno- 
mena from a variety of perspectives. Not long ago I was asked to speak to a 
group of graduate students on the subject of paradigms in educational re- 
search. I was one of a three-member panel invited to address the topic 
“Paradigms in Educational Research: Help or Hindrance?” (Readers will 
notice that there is a colon and a question mark in the title. The former 
ensured academic credibility, the latter promised—falsely as it turned out— 
an openness to new ideas). In preparing for the presentation, I had another 
look at Kuhn’s (1970) Structure of Scientific Revolutions, glanced through 
three years of the Educational Researcher, resampled Scharnberg’s (1984) 
little monograph The Myth of Paradigm-shift, or How to Lie with Methodol- 
ogy. Here are some of the thoughts that these activities stimulated. 

No behavioral science has or has ever had a paradigm. A paradigm in the 
sense that Kuhn uses the term sets the entire research agenda. I cannot think 
of a time during this century when there has been a research agenda in 
education or in any other behavioral science. There are always several agen- 
das in operation, although at different times some have been more powerful 
than others. What are often called paradigms in educational research are 
really methodological sects. A sect is a body of people adhering to a particular 
faith. Members of different sects see themselves and other sects in quite 
different ways. Some sects are ecumenical and view differences among meth- 
ods as being various ways of finding common truth. Other sects are fun- 
damentalist and see methodological differences as leading to different 
truths—some better than others. Some sects make great demands on their 
followers (ritual is important and members are disciplined for sins against 
the revealed method), others make few demands. Some sects have been 
around a long time; other sects are recent. 

The membership in a sect is largely a matter of socialization, differential 
ability, politics, and inertia. The socialization usually takes place in graduate 
school, but there are some dramatic examples of rebirth at later times. 
Differential abilities encourage us to take different combinations of courses. 
These magnify and consolidate differences. In the broad field of education, 
proponents of new sects arrive on the scene like evangelists and convert a 
generation of new scholars to their view. Older scholars often have either too 
much momentum or too much inertia to adopt new ways. 

Unless you are a tree (and may be called upon to sacrifice your life for the 
good of a research report), methodological sects are probably good for educa- 
tional research. Ninety years of activity has only begun to raise our under- 
standing of educational phenomena. The complexity of the problems and the 
limits of individual creativity encourage the view that educational re- 
searchers working in groups from a common base of understanding and 
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method are likely to have more time to devote to investigation and reflection 
than eclectic freelancers. The challenge is to find ways for the sects to 
influence each other while maintaining their own integrity. This is certainly 
a role for the “generic journal” of educational research. 

The sects of educational research are based on method. As I edit this 
generic journal, I do so with the belief that the goals of educational research 
regardless of sect are to understand and to explain. Some of the central 
components are definitions, evidence, coherence, and credibility. Validity of 
method is not merely a matter of technique properly applied, it is an aid to 
credibility. Educational research is not value free. There seems to be a strong 
link between what is accepted as evidence and what is valued by the sect. 
Criticism is a key component to any line of research. It, and the anticipation 
of it, keep researchers honest. One role of criticism is to explore the value 
influences in inquiry. Another is to introduce alternative perspectives to the 
researcher. I view criticism and the review process as one of the services that 
AJER provides to the research community. It is intended to aid the cycle of 
integration, disintegration, and reintegration, not to quieten controversy and 
destroy motivation. Methodological differences are welcomed here, but the 
subscribers to AJER do not all share a common pool of methodological 
beliefs. To succeed in their communication task, contributors must take 
great care in responding to three important questions: What do I mean? How 
do I know? and Why is it important? A generic journal of educational re- 
search does not discriminate on the basis of the “sectual” preferences of its 
contributors. 


References 
Kuhn, T.S. (1970). The structure of scientific revolutions. Chicago: University of Chicago 
Press. 
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University of Alberta 


Issues in Education and Daily Living for 
Families of Children with Disabilities’ 


General Introduction 
Having only 30 minutes to talk about almost a decade of my research risks 
trivializing the research. Therefore, I will not discuss the part of my McCalla 
research on identifying work skills and preferences, employer expectations 
and attitudes, and the employee-organizational climate match to improve 
chances of long-term, successful employment for persons with mental dis- 
abilities. 

Following the instruction to make the presentation practical and applied, 
I have chosen to focus on the research basis for my McCalla book on families, 
particularly aspects of the research of direct relevance to educators and 
helping professionals. 

Dr. John Goodlad, Director of the Center for Educational Renewal at the 
University of Washington, spoke here in early April (Goodlad, 1990). He 
made the point that, in faculties of education, we have made our research 
inaccessible to practitioners. He argues for a collaborative partnership be- 
tween universities and the community, so that research is relevant to both. I 
have tried to reach that ideal in my research. 

If a first principle of my research has been collaboration with my col- 
leagues and my co-researchers, the parents, then a second principle has been 
practical relevance and social need. With that principle in mind let me talk 
about: 


The Social Context of the Research 

Current social and political attitudes and actions impact on individuals with 

disabilities and their families. To illustrate: 

1. In 1987, the Alberta legislature proposed a new education act containing 
a clause which would have declared some individuals with severe dis- 
abilities to be noneducable. This would have denied their right to an 
education. Fortunately, strenuous lobbying efforts resulted in removal of 
that clause (Bisanz, Snart, Sobsey, & Wilgosh, 1988). 

2. The relatively high incidence of physical and sexual abuse of persons with 
disabilities (Sobsey, 1989), both in institutions and in the community, is a 
cause for serious concern. 

3. Thecurrent trend toward deinstitutionalization of individuals with severe 
disabilities, without adequate social and community supports, is resulting 
in denormalization of families (Pivato, 1986). 


Lorraine Wilgosh is a professor in the Department of Educational Psychology. Her research 
interests include families of children with disabilities and career education for individuals 


with disabilities. 
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4. The trend toward integrating students with disabilities into the regular 
classroom has serious implications for the teaching profession and for 
teacher education, because of failure to prepare all regular classroom 
teachers to teach children with special needs (Wilgosh & Yewchuk, 1982). 

5. The goal of full integration into society of persons with disabilities is 
thwarted by lack of independent living alternatives for adults with dis- 
abilities, by negative employer attitudes toward hiring such individuals, 
and by high rates of unemployment for persons with disabilities due to 
inadequate vocational preparation (Fagan & Jenkins, 1989). 

While individuals with disabilities may represent only about 10% of our 

Canadian population, we cannot ignore their plight. 


Background and Procedures 
Featherstone (1980), an educator and parent of a child with severe dis- 
abilities, made the following comment in support of qualitative research on 
families of children with special needs. 


Even if their findings were not buried in dusty monographs, professional 
writers too often adopt a style that submerges feelings and problems beneath 
a deluge of aggregated statistics. Perhaps their conclusions inform the vision 
of social psychologists; they rarely touch the lives of ordinary people strug- 
gling to understand their own marriages.-(p. 93) 


Ramey, Krauss, and Simeonsson (1989) commented on the increasing 
documentation of diversity in family adaptation for families with an atypical 
member. Family values, beliefs, social resources, and experiences significant- 
ly affect parenting stress and family adjustment. The authors acknowledged 
the need to combine both qualitative and quantitative research methods in 
studying families. 

Although the research I am presenting today is qualitative in nature and 
I acknowledge Featherstone’s concern, I strongly support the position of 
Ramey and her associates on the value of both quantitative and qualitative 
research in increasing our understanding of families with children who have 
special needs. 

The experiences that have directly shaped this research can be traced 
back to the early 1980s when one of my students, Melody Barry, interviewed 
parents and teachers about vocational training for adolescents with mental 
disabilities. The teachers believed that the parents were unrealistic about the 
needs of their children and uninterested in their children’s education. On the 
other hand, the parents were frustrated and angry about lack of communica- 
tion and support from the teachers. Melody’s master’s thesis (Barry, 1983) 
concluded that, while there was a lack of communication between parents 
and teachers, both groups viewed the vocational needs of adolescents with 
mental disabilities in similar ways, a basis for parent-teacher cooperation 
and collaboration in the education of young people with special needs. 

At about the same time, a major review was published by Gallagher, 
Beckman, and Cross (1983), identifying a shortage of knowledge and under- 
standing on the part of professionals about families of children with dis- 
abilities. They called for research to focus on the family as the unit of study 
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and for a “families” focus in the training of teachers and other helping 
professionals. 

From that starting point, it seemed important to talk to parents of 
children with disabilities, to learn more about their lives, how they have 
coped with raising children who have special needs, and about issues of 
greatest concern to them. I began to interview parents of children with 
special needs, and, over a number of years, worked with some of my students 
and colleagues to gather information from a number, now totalling over 80 
families, of children with disabilities: learning disabilities, moderate intellec- 
tual impairments, severe disabilities, visual and hearing impairments. The 
children’s ages ranged from preschool to the late adolescent years. While 
some of the families were single-parent families, the majority were two-par- 
ent homes. One or both parents participated in the interviews, at the parents’ 
convenience. We asked them to tell their own story about being parents of a 
child with a disability, as they recalled their experiences. Therefore, the 
experiences may have been colored by subsequent events and the passage of 
time. However, we did not try to direct the parents’ stories, allowing them to 
focus on what they thought was most important, assured of confidentiality of 
all personal identifying information. 

Qualitative procedures (Becker, 1986; Colaizzi, 1978; Giorgi, 1975, 1985; 
Polkinghorne, 1983) were used to extract the major themes from the parent 
interview protocols. Input from different groups of families was analyzed by 
different researchers to eliminate bias in identifying themes. The themes 
were then verified with the co-researchers, that is, the parents. 

I will describe briefly the six major themes common to the families regard- 
less of the child’s disability, using typical parent quotations to demonstrate 
some of the richness and variety within the themes. It is difficult to select 
only a few aspects from a large body of information. In fact, I hope I raise 
more questions than I answer to increase general awareness toward these 
families and their needs. 


The Themes 
Theme 1: Coping with the Knowledge of the Child’s Disability 


Some parents learned quickly about their child’s disability because a physi- 
cian identified the disability soon after birth. According to Glendinning 
(1983), parents may find it easier to come to terms with the disability when 
they find out early, because they can then set realistic expectations for their 
child. However, more often parents sensed that something was wrong long 
before they were told that they should be concerned. One mother recalled the 
day that she found out: 
The doctor came into my hospital room and said, “Your child is blind.” He 
didn’t even ask me to sit down so he could tell me some bad news. I wasn’t 
impressed with his manner, and neither were the other mothers in the room. 
He was muttering words that were about six inches long, and I didn’t under- 
stand what it was all about. 
Many other parents told us how they found out in similar insensitive ways, 
including by telephone. 
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out and look for support systems and I think services should be easier to 
locate. 


The families may need direct support and assistance with caregiving. 
They may also need emotional support from professionals and parent sup- 
port groups to help them deal with personal and family stresses. Parents 
want to have a parent-professional partnership of open discussion with 
knowledgeable and caring professionals who take the time to listen to them. 


Theme 3: Advocacy and Parent Initiative 


Many of the parents believe that their child’s educational and other oppor- 
tunities have been determined through the parents’ initiative and have a 
sense of mission, direction, or purpose as a by-product of their efforts to raise 
their child. This takes its toll on the family by taking yet more of the parents’ 
time and energy. Two examples will illustrate the advocacy role: 


We have had to educate a lot of people. Number one, teachers must start 
listening to parents that are behind the kids. We live with these kids 24 hours 
a day, and we know them. My 10-year-old son is labeled “mentally hand- 
icapped.” I have labels for my other children as well: my oldest child is labeled 
“sifted,” and my seven-year-old is labeled “overweight.” We’re all labeled but 
don’t just accept the labels; ask “Who is the person behind the label?” 


eK 
I really didn’t know, when I became a mother, that I was supposed to be a 


warrior. I thought, “nurture,” not “battle.” Yet as I am talking, I am aware 
that I need to fight if I want things to change. 


Theme 4: Parents Search for the Best Educational Opportunities 


Parents believe strongly that an excellent teacher leads to school success for 
children with special needs, and that both the personal and professional 
qualities of the individual teacher are important to success: 


The teacher has done a tremendous job. She has unlimited patience and kids 
like Adam need patience. If she feels frustrated she never lets it show. She 
just keeps at it until it clicks for him. He can talk to her about anything and 
really feel comfortable; it’s almost like having another mother at school. 


Generally, parents appreciate honest and open feedback from teachers and 
want to be actively involved with the school. From a father: 


To educators, I would say, “Listen to the parents more. I think in most cases 
the parents know, but you have to be willing to listen.” 


Contrast the positive experience with these: 


They reluctantly took him into a regular class and said they would do some 
accommodating. Their idea of accommodating was to remove so many reces- 
ses over the years and made him stay in and do extra work that it was bizarre. 


26 6 


We are out of date in our educational system, still hung up on labeling a child 
as handicapped. But labeling a child as mentally handicapped is not going to 
help us to develop an educational plan for the child. I think the local com- 
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munity school is the most sensible school if you integrate, but the principal 
told me right from the beginning that schools were for regular kids, not for 
special needs kids. I decided that people in regular education are not prepared 
for special needs kids. 


Children with disabilities have special needs which must be considered by 
the schools. Foremost, they need sensitive and competent teachers who can 
respond to each child’s needs, who don’t reject the child because of the 
disability and can work in cooperation with the parents. Parents feel angry 
and frustrated because of difficulty finding good programs. For some child- 
ren, attending special programs requires long hours of travel. In other cases, 
programs may not exist because of high costs, or because teachers have not 
been trained to teach children with special needs: 


They require speech therapy and this is the thing they’re not getting. The 
schools claim there’s not enough money in the budget, but they’re going into 
all these other languages in the school and yet they’re not even teaching these 
kids to speak English. 


Parents are divided on whether integration or special classes are good or 
bad depending on their personal experiences with either, as these contrasting 
examples show: 


They increased her resource room time and that turned out to be more 
disruptive than good for her. Presumably she is making some progress, but 
they take her out for 20 minutes and she comes back in and she’s supposed to 
catch up with the other kids in the classroom. She said when she asked the 
other children what they were doing they would say, “Go away. You’re just 
trying to copy,” so she stopped asking. 


2K 2K OK 


Bob is only one year behind in some of his subjects and so the teachers are 
saying, “We really would like to put this child in a regular classroom soon.” 
Bob has worked tremendously hard to get where he is. When he was told he 
might go into the regular social studies class, he was devastated. It’s a whole 
different ball game to sort out the noise from 24 or 30 kids as opposed to 10 
kids. He still can’t copy from the board. I feel as if he’s being punished because 
he’s succeeded. They’re not putting my kid in a regular classroom. 


People writing about successful integration (e.g., Hanline & Halvorsen, 
1989) say that it requires flexible teachers with positive attitudes and ade- 
quate training who keep up ongoing communication with the parents. 

Some of the parents feel strongly that teacher education isn’t doing a good 
job of preparing teachers to teach children with special needs. Here are some 
parents’ viewpoints: 

There is such rampant ignorance. There’s so much that teachers want to 

ignore. There is so much putting down of parents. Parents really know their 

kids. We do get overanxious but I don’t think we should be put down for it. 

Four years of education, or six, or eight, just don’t teach them anything about 

individuals. 

2K 2K 
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I think the teachers are not generally qualified to deal with a learning 
disabled child in the classroom as integration requires. And I don’t think it is 
good to take the children out temporarily and put them in a resource room 
with a special education teacher. But if regular class teachers are not equip- 
ped to diagnose the problem and to individualize instruction, then we have no 
choice but special classes. 


2F EK 


The teachers don’t recognize learning problems so how can they deal with 
them. Universities are not giving them the tools to work with children with 
handicaps. I really feel that teachers should have an introduction to children 
with handicaps as part of their program at university. If you are looking at 
one in 10 children with a learning problem, I think the school system needs 
trained teachers to recognize and teach these kids. 


Some parents have told us about very good experiences in schools, but 
more parents have had many poor experiences in the schools. They identify 
problems with both special and regular classes and poor teacher education as 
the causes. 


Theme 5: The Family and the Child with Special Needs 


Parents tell us that having a child with special needs has negative effects on 
family life because there is more stress on the family. Parents have responsi- 
bility for the child and for choosing services and education programs, and 
many parents feel guilty because this may leave less time and energy for 
other family relationships. However, there can also be strong positive effects 
on family life from the presence of a child with special needs. 

The more severe the child’s disability, the greater the demands seem to be 
on all family members, as in this example: 


You have to readjust your whole lifestyle around the schedule of the severely 
handicapped child. You can’t go on holidays; you can’t take him out to a lot of 
places. It not only affected my husband and me, but it also affected our 
relationship with our daughter and a lot of things she could and could not do. 
And we found that, when our younger son was learning how to walk, we had 
to say, “Be careful, you’ll step on Carl.” We were protecting Carl and at the 
same time restricting David. And with our daughter being older, a lot of 
responsibility fell on her from a very young age. 


Family conflict was not the main focus for most of the families that we 
interviewed and was not mentioned by many families, who carry on as well 
as they can. Featherstone (1980) wrote that the child’s disability amplifies 
problems in ordinary families and makes the parents more vulnerable to 
stress and conflict, particularly when they are feeling tired, afraid, or angry. 
She suggested that the parents may need help to disentangle their feelings 
about the disability from their feelings about their marriage, and that, al- 
though some marriages will fail, most will survive. She found parents to be 
unwilling to talk about marital conflicts, although some felt more free to talk 


after divorce. This may explain why so few parents talked to us about such 
conflicts. 
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Feelings of depression, loneliness and isolation, sadness, hopelessness, 
and guilt are common in such families, yet these families are basically normal 
families which show both successful and maladjusted behaviors, according to 
Yura (1987). Parents often express regret about the effects on their other 
children of having a child in the family who has a disability. However, it 
seems that most families function adequately, and the outcomes for the other 
children can range from very little effect to very positive or very negative 
effects (Yura, 1987). This example illustrates the positive aspects: 


Having a brother with a learning disability has taught his sister tolerance. In 
that respect it’s added to the strength of our family life. The kids have gained 
values well beyond their years, that will stay with them forever. 


Families with children who are disabled tend to be more isolated from 
their neighbors, relatives, and friends than are ordinary families, with the 
result that the child with a disability has little opportunity to develop impor- 
tant social skills and friendships. Isolation increases with adolescence (Yura, 
1987). This is one important area where the children and their families need 
assistance. Teachers and other professionals must educate nonhandicapped 
children so that acceptance of, and friendships with, children who have 
special needs will develop naturally. Support groups and families must assist 
these children to develop hobbies and recreational interests so that they don’t 
spend long, lonely hours alone watching television. 

I want to make the point here that sexuality education rarely came up 
with our co-researchers, nor did the topic of physical and sexual abuse. Likely 
sexuality education was less important to these families at this time, given 
their other major concerns. Regarding physical and sexual abuse, and per- 
haps sexuality as well, the parents may not be able to discuss the issues, or 
the issues may not have touched this sample of parents. 


Theme 6: What are the Future Prospects for the Child 


For all children the ideal future likely is a successful, independent life. 
However, for children with a disability, the parents’ dreams about the future 
may be quite different from this ideal. Parents may feel optimistic about 
adult independence for children with mild disabilities. On the other hand, 
parents worry about the work training, community employment, and inde- 
pendent living opportunities for children who have moderate to severe dis- 
abilities, particularly when the parents are too old to be the primary 
guardians for their children, as this example shows: 


Hopefully he’ll be able to do a job that he will get paid for. I don’t want him in 
a sheltered workshop, or an institution or activity center. I want him in the 
real world, doing real things with real people, living, learning, playing, work- 
ing. I can’t see far enough ahead to know if he will be able to live inde- 


pendently. 


Conclusions and Implications 
As indicated earlier, there are limitations to this type of research. Parents 
recall experiences selectively. Featherstone (1980) suggested that parents’ 
reports tend to be biased toward the positive. Such problems as marital 
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conflict or sexual abuse may be too personal to share or may be outside of the 
experience of the families. The process of verifying the themes with the 
co-researchers does not get at aspects they have omitted. Their reports of 
sibling experiences and of experiences of persons with disabilities cannot 
replace direct study of the individuals’ own experiences. Lack of control 
groups and small sample sizes can be criticisms of the research; however, 
there is a great deal of consistency among the families on the major themes, 
and the nature of many of their experiences is beyond those of the normal 
family. 

Among the needs identified by the families is training for medical and 
other helping professionals and teachers to avoid the lack of sensitivity and 
lack of knowledge experienced. For example, it is important to recognize the 
ongoing nature of the stress and other feelings experienced by these families 
and the absence of stages of adjustment. Counseling them to expect to go 
through stages from grieving to acceptance is too pat an answer to their 
needs. 

Many of the families show great strength in coping, but there is a need for 
adequate supports. The parents want to be accepted as partners in the task 
of providing for the development of each child’s full potential. 

Many of the parent comments on teaching and teacher education must be 
given serious consideration by educators. These parents have described the 
qualities of good teachers. They value teachers who show respect for the child 
and focus on the child, not on the disability. They also value teachers’ 
qualities of patience and sensitivity, dedication, knowledge, and understand- 
ing of the needs of the child and family. Generally, the parents that we 
interviewed appreciate honest and open feedback from teachers and want to 
be actively involved with the school. 

The parents have indicted teacher education for failing to prepare teach- 
ers to teach children with special needs. With integration becoming a reality 
in our schools, we cannot ignore the need to provide all student teachers with 
special education training, particularly in assessment and development of 
individual program plans, in classroom management, and in parent-profes- 
sional relationships. 


Note 


1. This article is based on a presentation by the author at the conclusion of her year as a 
McCalla Research Professor in the Faculty of Education at the University of Alberta. The 
contributions of the following individuals at different stages of the research are 
acknowledged: graduate students—M. Hopchin (Barry, 1983); K. Waggoner (Waggoner, 
1988); B. Adams (Adams, 1987); M. Morgan-Redshaw (Morgan, 1987); K. Hancock 
(Hancock, 1988); S. Chomicki; colleagues—M.A. Bibby, D. Sobsey, and L. McDonald; and 
co-researchers—the families whose collaboration made the research possible. 
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Early School Entry and Subsequent 
Academic Problems 


Learning impaired students and a comparable age and sex matched, normal- 
achieving control group (in grades 5-8) were categorized according to the time of 
the year of their birth. Those born in the last quarter of the year (October-Decem- 
ber) and who started school early as indicated by school records were defined as 
early entry (EE) students. The groups were evaluated for incidence of EE students, 
achievement, and self-esteem. Results revealed (a) that the incidence of EE stu- 
dents was higher than expected in the group of learning impaired students, (b) 
that the academic achievement of EE students was low in the control group, and 
(c) self-esteem of the learning impaired was higher for EE students, but only for 
those who had not been formally labeled as learning disabled. Formal labeling 
may impact negatively on self-esteem. Alternatively, those learning impaired stu- 
dents who are not suffering from self-esteem problems may not be moved further 
along the identification process to be formally labeled as learning disabled. Those 
children who are moved further along in the identification and placement process 
tend to have lower self-esteem and be early entry students. Educational tmplica- 
tions and recommendations are discussed. 


School entry age is a variable that may inadvertently influence referral, 
diagnostic, and remedial practices in education. It is possible that children 
could be considered learning impaired when in fact they are merely young 
(i.e., chronologically immature). Such incorrectly identified academic dif- 
ficulties could then contribute to self-esteem problems and to future academ- 
ically related problems. Thus schools, though attempting to help, could 
actually be involved in causing additional academically related problems by 
teaching curriculum material prematurely to the normal but young child. 
The research literature suggests that early school entry is a potentially 
serious problem in both referral practices and the identification of learning 
disabilities. For example, DiPasquale, Moule and Flewelling (1980) con- 
ducted a study of children referred for psychological assessment (N=552, in 
grades kindergarten through 13) in York County, Ontario. They noted an 
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increased incidence of referrals for early entry students. Further analyses 
revealed that the effect was evident for the primary grades only, for males 
only, and for academic referrals rather than behavioral referrals. Primary 
teachers apparently perceive early-entry males as predisposed to learning 
problems. 

Moreover, it seems that psychologists also may be attributing learning 
disorders to early school entry. In a study of children identified as learning 
disabled, mentally retarded, and emotionally disturbed, Maddux, Green and 
Horner (1986) reported that the total group of special education students 
(N=291), enrolled in grades 1-12, consisted of a disproportionately large 
number of children who were young when they entered school. The students 
were classified as early entry or late entry using 6.5 years as the youngest age 
to be identified as late entry. The expected frequencies (based on census data) 
of early and late entry children were compared with observed frequencies to 
determine if entry age was a chance variable. A chi-square analysis revealed 
observed and expected frequencies differed significantly with more early 
entry students in the sample of 291 special education students than expected. 
Separate chi-square analyses were computed for all categories and were 
significant for the learning disabled and emotionally disturbed groups but 
not for the mentally retarded. These results were consistent with earlier 
research (Maddux, 1983; Maddux, Stacy; & Scott, 1981) and supported the 
notion that early school entry may place certain children at risk. 

Diamond (1981) conducted a similar study in a particular jurisdiction in 
Hawaii and found a relationship between the identification of learning dis- 
abled children and time of year of birth. The sample population of 118 
referred children showed significantly more of the 42 boys identified as 
learning disabled were born later in the year (i.e., July-December), with 
similar results for the 17 girls identified as learning disabled. 

In a more extensive study Diamond (1983) used a sample of 154,203 
students. These students, between the ages of five and 20, were examined on 
such variables as sex, birth date, and handicapping condition. A significant 
relationship was found between the percentage of children born later in the 
year and those classified as learning disabled, with more learning disabled 
children born late in the year. Furthermore, maturity was not a factor that 
alleviated the problem in later years. Thus it would appear that early-entry- 
induced problems could persist into the upper elementary grades. 

These studies would suggest that age is inadvertently affecting both 
referral and diagnostic practices with apparent long-term academic effects. 
Problems in the primary grades due to early entry may be actually generating 
subsequent real and long-term handicaps (Uphoff & Gilmore, 1985). 

The question of specific long-term effects related to early entry is empiri- 
cal. Both academic and self-esteem problems have been suggested as poten- 
tially relevant effects (Diamond, 1983; DiPasquale et al., 1980; Gredler, 
1980). Accordingly, the focus in the present study was on (a) specific 


academic areas (i.e., reading, mathematics, and language) and (b) self-es- 
teem. 
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On the basis of the existing research it was predicted (a) that the frequen- 
cy of early entry students would be higher than expected (i.e., beyond a 
chance distribution) in a group of randomly selected learning impaired stu- 
dents, (b) that achievement levels would be lower in all academic areas for 
early entry students, and (c) that self-esteem would be diminished for early 
entry students. 


Method 
Subjects 


The sample consisted of 108 students (in grades 5-8) who were chosen from 
five demographically similar (i.e., middle class) schools. The learning im- 
paired group contained 54 students (38 males and 16 females; mean 
age=12.86 years; SD=1.21) and represented the older learning impaired 
students (i.e., the junior and intermediate level resource withdrawal classes) 
of the five schools. Older learning impaired students were the focus as we 
were interested in long-term effects. A control group was constructed using 
an identical sex distribution and a similar age distribution (mean age= 12.60 
years; SD=1.15). The control group was drawn from the same grades from 
the same five schools to ensure similar ages and socioeconomic backgrounds. 
To obtain a control group with a similar age distribution, classroom teachers 
(blind to the purpose of the study) matched the learning impaired students in 
their class with normal achieving classmates of comparable age. Although 
the matching procedure was less than precise, the groups were equivalent in 
terms of age, ¢(106)=1.18, p>.2. Moreover, the matching procedure 
managed to capture 17 early entry students in the control group which was 
confirmed subsequent to group formation. The learning impaired group was 
composed of (a) learning disabled students who had been identified and 
placed through a system-level Identification, Placement and Review Com- 
mittee (IPRC) Process (V=18) and (b) children who had not been formally 
identified but were withdrawn for regular resource assistance on the basis of 
school-level identification processes (V=36). Preliminary tests indicated 
these two learning impaired subgroups were equivalent in terms of age, 
achievement, and self-esteem (see Table 1). Thus the learning impaired 
students were treated as one group initially. 


Test Instruments 


Achievement in reading, mathematics, and language was evaluated utilizing 
the Canadian Achievement Test (1981), a modified version of the California 
Achievement Test. All students in the sample had been administered the 
Canadian Achievement Test (CAT) in accordance with the board’s ad- 
ministrative practices. Unfortunately, the reliabilities for the CAT are not 
readily available. However, Salvia and Ysselydke (1981) report evidence for 
very good reliability and validity data on the California version of the test. 
They indicate that for total scores internal consistency coefficients generally 
exceed .85, and test-retest reliabilities are generally greater than .90. 
Self-esteem was determined using the Behavioral Academic Self-Esteem 
(BASE) rating scale (Coopersmith & Gilberts, 1982) which was administered 
and scored by school personnel. Teachers rate students on such scales as 
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Table 1 
Identification Data on Age, Canadian Achievement Test (CAT) Grade 
Equivalent Scores and Behavioral and Academic Self-Esteem (BASE) 
Percentile Ratings for the System-Identified and the School-Identified 
Learning Impaired Students 


System-ldentified School-ldentified 
Age 413.1400 (1:27) 12:70 CAT) 
Canadian Achievement Test 
Language 2.43 (0:56) 2.69 (1.41) 
Reading 2.19 (0.89) 2.60 (1.39) 
Mathematics B. ceen heb) 32a LOO} 
BASE 36.56 (22.10) 41.13 (21.80) 


Note: The two groups are equivalent on all measures at the .05 level. 


Student Initiative, Social Attention, Success-Failure, Social Attraction, and 
Self-Confidence. The BASE was developed to establish construct and predic- 
tive validity measures of school success. This is a teacher-impression rating 
form and the limitations of teacher ratings are acknowledged (Foster & 
Ysseldyke, 1976; Greenwood & Morton, 1989). However, interrater 
reliability was estimated as moderately strong (r=.71) when different teach- 
ers were utilized. No substantial differences in the relationships across the 
sexes was noted, and the resulting reliabilities of BASE ratings were .83 for 
boys and .84 for girls (Coopersmith & Gilberts, 1982). 


Procedure 


Students were classified in terms of four entry-age levels. The year was 
divided into four quarters consisting of three months each and children were 
identified as belonging to one of the four quarters on the basis of their month 
of birth. All students born in the last quarter (October, November, Decem- 
ber) formed the early entry group. These were the students who were four 
when they began formal schooling as confirmed by school records. All other 
students in the study were five when they began school. For grouping pur- 
poses the year was divided into four quarters rather than thirds or halves 
because our early entry students clearly fell into the last three months of the 
year. Frequency data were collected to determine the number of students 
born in each quarter of the year for the learning impaired and the control 
group. For the achievement test (CAT) the grade equivalent scores were 
analyzed first to provide rough estimates of the degree of lag due to any early 
entry effect. To determine relative standing, percentile scores (based on local 
norms determined by the test distributor) for three of the CAT subtests 
(language, reading, and mathematics) were collected for all students and 


analyzed. The BASE was administered with school personnel to evaluate 
each student’s self-esteem. 
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Results 
Frequency Data 


The distribution of learning impaired subjects born in each of the four 
quarters was evaluated using a chi-square analysis on the Birth Quarter 
(first, second, third, fourth) frequencies. A significant difference was evident 
in the distribution, chi-square (3)=19.92, p<.001. As predicted, in the group 
of learning impaired children a high percentage (50%) were born in the last 
quarter of the year and were early entry students. In the comparison group 
also a large percentage of the children were born late in the year (69% in the 
last two quarters), chi-square (3)=9.55, p<.05. However, this would be an 
artifact of our attempts to match subjects on age. Frequencies are reported in 
Table 2. 

Further analysis of the frequency data was undertaken after subtyping 
the learning impaired students into two groups (i.e., system-level identified 
and school-level identified). In the school-level identified group there was no 
significant early entry effect, chi-square (3)=5.58, p>.1. However, in the 
system-level identified learning impaired group there was a strong early 
entry effect, chi-square (3)=26.89, p<.001. As may be seen in Table 3 most 
of the students in the system-level identified group were born in the last 
quarter. Early entry in the learning impaired group seems to be influencing 
the system-level identification process. 


Achievement Test Data 


Grade Equivalent Scores. The means and standard deviations for the 
CAT grade equivalent scores are reported in Table 4. Analyses of variance 


Table 2 
Frequencies of Children Born in Each Quarter of the Year for 
the Learning Impaired Group and the Control Group 


Learning Impaired Control 
Time of Year of Birth 
First Quarter 6 12 
Second Quarter 13 5 
Third Quarter 8 20 
Fourth Quarter 27 17 
Table 3 


Frequencies of School-Level-Identified and System-Level Identified Learning 
Impaired Children Born in Each Quarter of the Year 
eee eS ee eee ee 
School-Level System-Level 


Time of Year of Birth Identified Identified 


First Quarter 5 1 
Second Quarter te 2 
Third Quarter 7 1 
Fourth Quarter 14 14 


Snr nnn nnn Eat EEtEEEEES a 
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(ANOVA) were computed for language, reading, and mathematics using 
Group (learning impaired, control) and Birth Quarter (first, second, third, 
fourth) as the independent variables. The sex distribution was insufficient to 
allow for sex to be entered as a third independent variable. 

An early entry effect was noted for the Language test and the Reading 
test. On the Language test the ANOVA indicated a main effect for Group, F 
(1, 99)=71.83, p<.001, due to superior performance by the control group, 
and a two-way interaction for Group and Birth Quarter, F(3, 99)=3.15, 
p<.05. As may be seen in Figure 1 all students in the experimental group 
showed low performance with no differences between the various quarters 
(p>.05). Analysis of simple effects (Winer, 1971) indicated that the interac- 
tion was due to students in the control group who were born in the last 
quarter, that is early entry control students. They were performing lower 
than their peers born in the first quarter, ¢(27)=2.22, p<.05, the second 
quarter, ¢(20)=2.77, p<.025, and the third quarter, ¢(35)=2.59, p<.025. 
Interestingly, they were approximately two years behind their peers, yet 
teachers had not referred them for school-level or system-level identification 
and remedial assistance. Perhaps teachers look for a more egregious lag 
before referring students for remedial assistance. Also of interest, a sub- 
sequent analysis indicated there was no difference between the school-level 
identified (mean=2.56; SD=0.55) and the system-level identified 
(mean=2.47; SD=0.65) learning impaired born in the fourth quarter, 
t(25)=0.39, p>.1. 

On the Reading test the ANOVA indicated a similar main effect for 
Group, F(1, 99)=51.67, p<.001, due to superior performance by the control 
group, and a two-way interaction for Group and Birth Quarter, F(3, 


Table 4 
Grade Equivalent Scores on the Subtests of the Canadian Achievement Test 
for Children Born in Each Quarter of the Year for the Learning Impaired 
Group and the Control Group 


Time of Year Learning Impaired Control 
Subject of Birth Mean SD Mean SD 
Language 1st Quarter 2iks 0.83 6.27 2.59 
2nd Quarter 2.62 Q:55 7.42 Oohe 
3rd Quarter 2.38 0.75 6.74 3.18 
4th Quarter 2.51 0.59 4.51 1.68 
Reading 1st Quarter 2.02 0.79 573 156 
2nd Quarter 2.29 0.78 (wwe 3.68 
3rd Quarter 2.69 bee a Sou 2.88 
4th Quarter 2.48 1.38 3.99 1.42 
Mathematics 1st Quarter lar ge) 1.48 5.57 1.84 
2nd Quarter 3.42 1.20 5.78 0.61 
3rd Quarter 3.46 1.09 5.49 1.93 
4th Quarter 3.19 0.80 4.88 1.56 
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MEAN GRADE EQUIVALENT SCORES 


FIRST SECOND THRD FOURTH 
TME OF YEAR OF BRTH BY QUARTERS 
Figure 1. Grade equivalent scores on the Language test for the learning impaired 


and control groups for those born in each quarter of the year, showing the reduced 
performance for those born in the fourth quarter in the control group. 
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Figure 2. Grade equivalent scores on the Reading test for the learning impaired and 
control groups for those born in each quarter of the year, showing the reduced 
performance for those born in the fourth quarter in the control group. 


317 


L.L. Morton and N.M. Courneya 


99)=3.57, p<.025. Again early entry students in the control group were 
performing below their peers in the first quarter, ¢(27)=3.10, p<.01, the 
second quarter, ¢(20)=3.05, p<.01, and the third quarter, #(35)=2.06, p<.05. 
They were approximately 1.5 years behind their peers. Subsequent analysis 
indicated there was no difference between the school-level identified 
(mean=2.72; SD=1.73) and the system-level identified (mean=2.26; 
SD=0.97) learning impaired who were born in the fourth quarter, 
f=(20)—O 7 dap e 

On the mathematics test no statistically significant early entry effect was 
evident. 

Percentile Scores. The means and standard deviations for the CAT per- 
centile scores based on local norms for each grade level are reported in Table 
5. Two-way (Group by Birth Quarter) ANOVAs were computed for language, 
reading, and mathematics. 

An early entry effect was noted for the Language test. The analysis of 
variance revealed main effects for Group, F(1, 99)=142.51, p<.001, and 
Birth Quarter, F(8, 99)=3.36, p<.025, which are qualified by a significant 
two-way interaction for Group and Birth Quarter, F(3, 99)=2.85, p<.05. As 
may be seen in Figure 3 all students in the experimental group showed low 
performance with no differences between the various quarters (p> .05). How- 
ever, in the control group those born in the last quarter (i.e., early entry 
control students) were performing less well than students in the immediately 
preceding quarter (i.e., the third quarter), ¢ (35)=2.06, p<.05, and students 
born in the first quarter, ¢(27)=2.56, p<.025. The detection of a statistically 
significant difference between early entry students and students born in the 
second quarter was precluded by the small sample size (V=5) for the second 


Table 5 
Percentile Ratings Using Local Norms on the Subtests of the Canadian 
Achievement Test for Children Born in Each Quarter of the Year for the 
Learning Impaired Group and the Control Group 


Time of Year Learning Impaired Control 
Subject of Birth Mean SD Mean SD 
Language 1st Quarter 6.17 6.18 59.75 19.39 
2nd Quarter Wares 20.20 59.20 23.18 
3rd Quarter 7.00 6.30 62.10 23.70 
4th Quarter 9.23 7.62 41.06 19.39 
Reading 1st Quarter 8.50 7.89 61.75 12ine 
2nd Quarter 17.23 19.80 71.20 27.93 
3rd Quarter 1Sht3 13.21 58.65 25.68 
4th Quarter 15.77 18.49 47.18 25.28 
Mathematics 1st Quarter 7.33 12.26 61.33 24.62 
2nd Quarter 17.00 19.62 61.20 25.17 
3rd Quarter 11.50 11.19 65.55 20:25 
4th Quarter 14.08 13.02 52.00 25.10 
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MEAN PERCENTIEE SCORES 


| 
FRST SECOND THRD FOURTH 


TME OF YEAR OF BRTH BY QUARTERS 


Figure 3. Language percentile scores for the learning impaired and control groups 
for those born in each quarter of the year, showing the reduced performance for 
those born in the fourth quarter in the control group. 


quarter students. On the analyses for reading and mathematics, Birth 
Quarter was not a significant factor using local percentile scores. 


Self-Esteem Data 


The means and standard deviations for the BASE ratings are reported in 
Table 6. The two-way (Group by Birth Quarter) ANOVA revealed no main 
effects but there was a Group by Birth Quarter two-way interaction, F (3, 
99) =4.57, p<.01. Simple effects tests (Winer, 1971) indicated group differen- 
ces in the first three birth quarters (p<.05) but not in the fourth quarter 
(p>.05). Furthermore, although there appear to be differences in the control 
group between quarters (see Figure 4) the simple effects tests revealed that 
in the control group there were no differences in ratings between subjects in 
each quarter (p>.05). However, in the learning impaired group students 
born in the fourth quarter (i.e., early entry students) received significantly 
higher ratings than students born in the first quarter, ¢(31)=2.19, p<.05, 
students born in the second quarter, ¢(38)=2.13, p<.05, and students born in 
the third quarter, ¢(33)=2.14, p<.05. This interaction effect is presented 
graphically in Figure 4. 

Further analysis of the self-esteem ratings for the system-level identified 
and school-level identified learning impaired students who were born in the 
fourth quarter indicated that the school-level identified learning impaired 
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PERCENTILE SCORES 


FIRST SECOND THRD FOURTH 


TME OF YEAR OF BIRTH BY QUARTERS 


Figure 4. BASE rating percentile scores for the learning impaired and control 
groups for those born in each quarter of the year, showing the higher ratings for 
early entry learning impaired students. 


Table 6 
BASE Percentile Ratings of Children Born in Each Quarter of the Year for the 
Learning Impaired Group and the Control Group 


Learning Impaired Control 
Time of Year of Birth Mean SD Mean SD 
First Quarter 26.8 10.9 48.5 29.6 
Second Quarter 34.3 16.8 66.8 27.6 
Third Quarter 29.0 18.1 56 :2iereo.o 
Fourth Quarter 48.0 229 Sisco 


had higher esteem ratings (mean=60.92; SD=14.43) than the system-level 
identified learning impaired (mean=36.00; SD=23.22), t(25)=3.32, p<.01. 


Discussion 
Incidence 


The findings revealed a high frequency of early entry students in the learning 
impaired group supporting the prediction that early entry students are at 
risk of being labeled as learning impaired. In fact, 50% of the students in the 
learning impaired group were born in the last quarter of the year. Such a 
finding is consistent with previous research (Diamond, 1983; DiPasquale et 
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al., 1980; Maddux et al., 1986) reporting a higher incidence of early entry 
students amongst those referred for psychoeducational evaluation and those 
identified as learning impaired. In the present study the subsequent analyses 
indicated that the effect was evident for the system-level identified rather 
than the school-level identified learning impaired. This indicates that many 
children targeted for remedial assistance and then referred on for formal 
identification are in fact early entry students. 

Because the learning difficulties of early entry students may be a function 
of age rather than some specific disorder, remedial interventions may not be 
the most appropriate. Alternate interventions such as (a) curriculum reduc- 
tions, (b) the insertion of another year in the primary division, (c) double 
entry dates, and (d) delayed entry might be more appropriate. Moreover, the 
question concerning what would be the best intervention is considered to be 
a timely question in terms of current Canadian educational concerns. 


Achievement 


When grade equivalent scores were used to provide estimates of actual 
performance levels, an early entry effect was evident for both language and 
reading, with students born in the fourth quarter showing significant perfor- 
mance lags. However, the effect was noted on the control group, not the 
learning impaired group. Possibly a floor effect obscured differences in the 
learning impaired group as all of the learning impaired were showing poor 
academic performance. | 

The depressed performance in the control group suggests an overall 
general language delay is more prevalent with early entry students. Thus 
general and persistent language related problems may be typical of early 
entry even after five or more years of schooling, supporting Diamond’s 
(1983) contention that the problems are not ameliorated with time. 


Self-Esteem 


A particularly salient finding emerged from the self-esteem data. In the 
learning impaired group it was noted that the early entry learning impaired 
students (i.e., those born in the fourth quarter) received higher BASE ratings 
than the regular entry learning impaired students. It appeared that early 
entry students were not suffering from the same self-esteem problems that 
characterized the regular entry learning impaired. However, when the find- 
ings for the system-level identified and the school-level identified learning 
impaired early entry students were compared, it was evident that it was the 
school-level identified learning impaired who had the higher self-esteem 
ratings. The system-level identified learning impaired had lower self-esteem 
ratings. Perhaps being labeled as learning disabled contributes to self-esteem 
problems in early entry children. Alternatively, it is the students with self-es- 
teem problems in addition to academic problems who are referred on to the 
system-level identification, placement, and review process. Clarification of 
the relationship between self-esteem, referral, and labeling should be careful- 
ly addressed in future research. 

The low self-esteem ratings for the learning impaired regular entry stu- 
dents and the system-level identified learning impaired early entry students 
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suggest that the learning impaired generally have serious self-esteem 
problems in addition to academic problems. In view of the potentially detri- 
mental effect of low self-esteem, this area should be a primary focus in 
remedial programs. 

Limitations in the present study relate to the small sample size in each of 
the four quarters, the subjectivity in BASE ratings, and the quality of the 
standardized test (i.e., the CAT). In addition, although we were looking for 
long-term effects of early entry, there are grade limitations in the study (ce., 
the use of grades 5 to 8 while excluding the primary grades). The effect of 
early admission may be more pronounced in the primary grades. For ex- 
ample, it has been noted by Maddux et al. (1986) that the disadvantage of 
early entry becomes progressively less important as children move through 
the grades although Diamond (1983) does not concur. A more comprehensive 
study should address all grade levels. 

Regardless of the limitations, the present findings clearly indicate that 
there is an abnormally high incidence of early entry students in groups 
targeted as learning impaired, and many of these students are referred on for 
system-level evaluations, possibly on the basis of problems related to self-es- 
teem. Furthermore, even children who are not identified as learning im- 
paired—yet are early entry students—show language related delays that 
appear to persist into the upper grades of the elementary school setting. 
Perhaps the most intriguing finding, however, relates to self-esteem. In the 
group of children identified as learning impaired those who were early entry 
students and manifested lowered self-esteem tended to be formally identified 
as learning disabled, whereas those who did not appear to be suffering from 
reduced self-esteem remained as school-level identified learning impaired 
students. Further exploration of this self-esteem difference in the identifica- 
tion process would appear to be warranted. It may be that referral practices 
are also under the influence of perceived problems in self-esteem, at least for 
early entry students. 

Finally, the present research supports the contention that ways to 
ameliorate the early entry effect are needed. Thus future research should 
address the viability and efficacy of solutions such as delayed entry, double 
entry dates, extended primary programs, and individualized programming. 
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Student Dropouts and Scheduling 
Patterns in Secondary Schools: 
An Exploratory Study’” 


This empirical, archival study investigated the relationship between dropout rates 
and secondary school scheduling patterns in a purposive sample of Ontario public 
school boards using pairs of schools matched on internal and external environ- 
mental conditions. The scheduling patterns compared were full-credit half-year 
semestered schools and full-credit full-year traditionally organized schools. One 
plausible rival explanation for differences in dropout rates was measured and 
found to be significantly related to dropping out. A statistically significant rela- 
tionship was found between dropout rates and scheduling patterns but, when the 
relationship was examined controlling for the academic-level of a student’s pro- 
gram, an interactive effect was revealed. If dropout rates are a measure of school 
effectiveness, it might appear that one form of scheduling contributes more effec- 
tively than the other to keeping certain types of students in school. 


Dropouts are attracting increased attention from the Ministry of Education 
and from Boards of Education in Ontario as the relevance of education comes 
under close scrutiny. Radwanski (1987) wrote: “It is therefore the finding of 
this [his] study that the high school dropout rate in Ontario, as precisely as it 
can usefully be identified, is between 31 and 33 per cent” (p. 70). Citing other 
studies, Radwanski listed factors related to dropping out, namely, socio- 
economic status, family structure, high school streams, academic perfor- 
mance, part-time work, ethnic background, and psychological character- 
istics. General causes of dropping out were reported to be school-related 
reasons, which outweighed work-related reasons which outweighed personal 
reasons. He claimed the importance of school-related reasons might be un- 
derstated because citing some other reason for dropping out implied that 
school was not attractive or satisfying; school factors were implicit in any 
decision to drop out. Specific school-related reasons were general dislike, lack 
of interest, boredom or (ir)relevance, academic weakness or difficulties, sys- 
temic rejection, and perceived indifference among educators. Radwanski’s 
report implied that schools were not being effective in meeting students’ 
needs. 

Using dropout rates as a measure of school effectiveness has been sup- 
ported empirically. Rutter (1983) wrote: “whether or not they [students] 
continue in some form of education after reaching the end of compulsory 
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schooling ... has been used as a school outcome measure in several studies of 
school effectiveness” (p. 11), and Squires, Huitt, and Segars (1983) report the 
use of dropout rates as outcome measures of school effectiveness (p. 48). 

A major change in school scheduling patterns has been occurring throug- 
hout Ontario during the last two decades; namely, a widespread shift to the 
semestered pattern of scheduling schools. “About one-third of Ontario’s 597 
secondary schools were operating on the semester system during the school 
year 1981-82 compared with only 24 schools ten years earlier” (Raphael, 
Wahlstrom, & McLean, 1986, p. 36). By 1986, that fraction had grown to over 
two thirds (Ministry of Education, 1986) and now likely exceeds three 
quarters. The incidence of semestering in Ontario varies regionally, with 
northern regions having higher semestering rates than southern regions. 

The scheduling pattern in a secondary school—either full-credit full-year 
courses (traditional scheduling) or full-credit half-year courses (semestered 
scheduling)—could influence many aspects of teaching and learning. Purkey 
and Smith (1982) view school systems “as ‘nested layers’ in which each 
organizational level sets the context and defines the boundaries for the layer 
below” (p. 64). School characteristics affect classrooms; scheduling patterns 
are school-level phenomena. Ross (1977) reported that “the popularity of 
semestering is founded on the belief that this organizational change makes a 
positive contribution to the emotional, social, and cognitive growth of stu- 
dents” (p. 23). School characteristics affect learning. Raphael et al. (1986) 
reported that the benefits of semestering over traditional scheduling pat- 
terns included opportunities for (a) greater flexibility in course offerings and 
selection, (b) student acceleration, (c) more varied pedagogy, and (d) im- 
proved student study habits, motivation, and achievement. Many of those 
claims by Ross (1977) and Raphael et al. (1986) must yet be validated, and 
semestering—and any unintended consequences it seems—has been imple- 
mented with few questions being asked. King, Clements, Enns, Lockerbie, 
and Warren (1975), however, reported that a number of small-scale studies 
showed that students and teachers who had experienced semestering were 
more supportive of it than opposed to it. 

The reasons why students drop out of school, as reported by Radwanski 
(1987), were numerous. Dropouts generally either disliked or were not satis- 
fied with schooling that had not been effective in reaching them. King (1986) 
reported a relationship between student “academic stream” and general 
satisfaction with school; the more academic the stream, the greater the 
satisfaction. A later study by King, Warren, Michalski, and Peart (1988) 
revealed significantly different dropout rates across Ontario’s academic 
streams: academic-level students (university, college, or workbound) made 
up 12%; general-level students (college or workbound) made up 75%; and 
basic-level students (workbound) made up 7% of dropouts (p. 17). Schools 
appeared to differ in their effectiveness in reaching different groups of 
students. 

Discussions on the efficacy of semestered versus traditional scheduling 
took place in 1989 in one large eastern Ontario school board as school 
operations, student retention, and enrollment trends were examined. Board 
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staff were concerned about retaining students; students, however, will only 
remain in school if schooling effectively meets their needs (King, 1986). One 
issue that emerged was: Were dropout rates and school scheduling related? 
If school-related factors were primary factors in students dropping out, and 
if differences in opportunity actually existed between semestered and tradi- 
tionally scheduled schools, differences created by scheduling patterns might 
be related to students dropping out of high school. For example, longer class 
periods but shorter course duration in semestered schools might create pres- 
sures that contribute to students dropping out. Conversely, shorter periods 
but extended course duration in traditional schools might create other pres- 
sures that contribute to dropping out. The question of dropouts and school 
scheduling patterns appeared worthy of investigation. 

Literature on dropouts and scheduling patterns was limited. An ERIC 
search yielded five references on that relationship, two of which dealt with 
changing school schedules as a means, in part, of reducing dropout rates 
(Gifford, 1987; Young, 1986). Another search on dropouts in ONTERIS 
(Ontario Educational Resources Information Service) yielded 18 citations, 
none of which related to school scheduling. Correlates of effective schools— 
which could be related to outcome measures such as dropout rates, grades, 
college entrance rates, and achievement and test scores—were reported by 
Edmonds (1982), Rutter (1983), and Squires et al. (1983). Those correlates 
included degrees of academic emphasis, skills of teachers, teachers’ actions in 
lessons, rewards and punishments, pupil conditions, responsibility and par- 
ticipation, and staff organization (Squires et al., 1983, p. 56). Squires et al., 
in citing Coleman et al., 1966, reported that socioeconomic status and stu- 
dent attitudes showed the strongest relationship to achievement (1983, pp. 
48-49). Relating school effectiveness or conditions to dropout rates—an out- 
come—would be complex. 

Investigating the relationship between scheduling patterns and dropout 
rates seemed worthwhile. Several correlates of effective schools (skills of 
teachers, teachers’ actions in lessons, and staff organization) were likely 
nested in classrooms—to a greater or lesser degree—through a school’s 
scheduling pattern (King et al., 1975; Purkey & Smith, 1982). Do schools, or 
their scheduling patterns, make a difference? 


Purpose 
The study was prompted by (a) the author’s interest in differences between 
semestered and traditional schools (Sharman, 1989) and (b) by the general 
perception that dropout rates were higher in semestered schools than in 
traditional ones. That perception was worthy of empirical study, especially in 
light of findings reported by King et al. (1975): Those who had experienced 
semestering were more supportive of it than opposed to it. 

The study was supported by the provincial office of the Ontario Secondary 
School Teachers’ Federation. The purpose was to determine if dropout rates 
were related to schools being semestered or not. More specifically, the re- 
search problem was: Is there a relationship between scheduling patterns and 
dropout rates controlling for the academic-level of students, and control- 
ling—through matched sampling—for differences in environmental condi- 
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tions? Such information could also be useful to school and board ad- 
ministrators. If a relationship was found, schools and boards would have 
acquired a focus on a potential cause of dropping out, and further study could 
be directed at school characteristics affected by scheduling to isolate reasons 
for students dropping out. Further study would be essential; reliance on the 
findings of one, or a few, correlational study(ies) as a base for any action 
would be inappropriate.? 


Method 

The study was conceived as Design 3 (Campbell & Stanley, 1963) using data 

gathered through unobtrusive measures from archival sources. The limited 

source of data chosen to carry the variables—student records—precluded a 

complex study of effective school characteristics relative to an outcome 

variable. 
Three research questions were asked: 

1. Was there a relationship between school scheduling patterns and dropout 
rates? 

2. Was there a relationship between student academic level and dropout 
rates? The relationship reported by King et al. (1988) concerning dropout 
rates and academic-level was worthy of further study; was the rela- 
tionship replicated, and were there relationships within academic levels 
between school scheduling patterns and dropout rates? In other words, 
did school scheduling patterns affect all academic levels of students the 
same way? 

3. Was there a relationship between school scheduling patterns and dropout 
rates within (controlling for) student academic levels? 

Definitions 

Students were classified by their enrollment status. A dropout was defined in 

this study as any student who leaves school before having obtained his or her 

Secondary School Graduation Diploma or his or her Ontario Secondary 

School Diploma and who either (a) reported he or she was withdrawing to 

employment or (b) the reason for his or her withdrawal was unknown. Thus 

a student known to be moving to another province or country, or transferring 

to another school was not considered a dropout. 

Scheduling pattern was defined as the form of scheduling used in a school. 

In this study, two types of schedule were recognized: 

1. traditional schools employing full-credit full-year organization that typi- 
cally allowed 8 or 9 courses to be taken throughout the year; 

2. semestered schools employing full-credit half-year organization that typi- 
cally allowed 4 (perhaps 5) courses to be taken each half year. 

Other possible scheduling patterns included (a) half-credit, half-year or- 

ganization and (b) trimestered organization, but no Ontario secondary 

schools used such patterns exclusively across all grades (9-13) (Ministry of 

Education, 1987). 

Academic level, which might reflect high school streams, was defined as 
the average level of difficulty at which a grade 10 student took his or her 
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compulsory courses. Those compulsory courses were restricted to four of 
English, mathematics, science, and history or geography. 


Sampling 


A nonprobability (purposive) sample of Ontario public school boards was 
drawn. The criterion for selection was that a board contained both semes- 
tered and traditionally scheduled high schools (Ministry of Education, 1986). 
Nineteen public school boards—of 78 public school boards—met that crite- 
rion; almost all northern Ontario and most rural boards were excluded. 
Seven boards agreed to participate, six declined for logistical reasons (either 
data were not easily gathered or all schools were now semestered), and six 
boards declined for other reasons or did not respond. 

The literature indicated a number of attribute variables were related to 
dropouts. Socioeconomic status, family characteristics, size of school, type of 
program in school, and economic opportunities in the region were rated as 
important factors. Each participating board was asked to provide pairs of 
schools—semestered and non-semestered—matched on location, general 
socioeconomic equivalence of community, size of school, and type of pro- 
grams offered. In that way, the sources of subjects (schools) and the students 
in them would be “equalized” or matched on important plausible rival ex- 
planations that could relate to dropping out. The closeness of match of 
schools and their environments was not directly verified, but the data sources 
contained pairs of schools from the same cities, pairs of comprehensive high 
schools, a pair of rural schools, and pairs of high schools with French Immer- 
sion programs. 

A census of students (data) with entry dates into grade 10 between 
September 1 and November 30, 1986 was taken. That group was chosen 
because they would likely experience the same organizational characteristics 
throughout their schooling, and that group could be followed for three years 
to June 1989—the year in which, under OS:IS (Ontario Schools: Inter- 
mediate and Senior; Ministry of Education, 1983) a circular defining school 
organization) they could first graduate. Those students had had sufficient 
time to change the level of their courses if inappropriate academic levels had 
been chosen in grade 9. Taking that one cohort, rather than analyzing the 
dropout patterns for one year, avoided including students who may have 
“dropped in” to an unfamiliar scheduling pattern late in their schooling. 
Such students could experience adjustment problems in general rather than 
just problems associated with a scheduling pattern. 


Data Collection and Coding 
Data were collected during July, August, and September after all 1989 course 
and enrollment data had been updated to reflect the 1989 year. They were 


gathered unobtrusively through a computer program developed by the author 
and sent to each board for use in its centralized data bank. Two of the boards 


did not have centralized banks and found data were too hard to gather; they 


did not ultimately participate in the study. 
The quality of the information received concerning students undoubtedly 


depended on (a) the quality of information supplied by students on 
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withdrawal from school, (b) the quality of a school’s information gathering 
system, or (c) on the quality of data entry into the system. It would be 
improbable—although not impossible—that a systematic bias in data quality 
would exist between the two groups of schools. Errors associated with data 
would likely be random relative to scheduling pattern. 

Data were gathered and coded for the following variables: (a) board; (b) 
school; (c) student code; (d) school scheduling type; (e) sex of student; (f) 
student enrollment status by the end of [1] second year, [2] third year, and 
[3] fourth year in secondary school; (g) the academic level of a student’s 
grade 10 compulsory courses; and (h) a student’s accumulated credits 
through June 1989. For (f) above, enrollment status was classified as (1) 
returning to school, (2) dropped out, (3) graduated, or (4) transferred. The 
value for enrollment status (f[3] above) ultimately determined the enroll- 
ment status of a student: codes 1, 3, or 4 were recoded as “not a dropout,” 
code 2 remained as “a dropout.” 

Student academic level was determined as follows. Advanced-level cour- 
ses were coded as “1”, general-level courses were coded “3,” and basic-level 
courses were coded “5.” A student mean course score was calculated, and 
then mean course scores between 1.0 and 1.9 were recoded “advanced,” 2.0 
through 4.0 were recoded “general,” and over 4.0 were recoded “basic.” 
Academic level was used as a statistical control variable. 


Results 
Summary Statistics 


Data were received from five school boards and from a total of 18 schools 
within those boards. The sample size for students was N=4,886. Almost 49% 
of the sample came from semestered schools, 51% from traditional schools; 
47.2% of the sample were female, 52.8% were male. The sample comprised 
55.2% advanced-level students, 39.4% general-level students, and 5.5% 
basic-level students. The unit of analysis was the student. 

Table 1 summarizes student enrollment status for the entire student 
sample from September 1986 (grade 10) through to September 1989 (the 
start of year 5). “Mortality” is the number of students who left the sample 
during previous years as transfers, graduates, or dropouts. 

Table 1 shows that 97.3% of the sample remained at the end of the grade 
10 year; 90.2% remained at the end of grade 11; 85.1% remained at the end 
of grade 12, and 74.1% returned to begin grade 13 (year 5). Slightly over 10% 
of the sample had dropped out of school by the beginning of the cohort’s fifth 
year. That percentage differs markedly from the 30 to 33% reported by 
Radwanski (1987), but this study measured dropouts through only four of the 
five years of secondary school that is normal for students in Ontario. 


Testing for Relationships 


The three major variables implicated in the study—school scheduling pat- 
tern, enrollment status, and academic level—were all categorical variables. 
Relationships were analyzed using the chi-square test. Table 2 shows the test 
between scheduling pattern and enrollment status; Table 3 shows the test 
between academic level and enrollment status; and Table 4 shows the test 
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Table 1 


Enrollment Status of Students by Grade and Destination 
enero Sem stars ee el Fk} neg es, Lee ee ee ee 


Grade 10 Grade 11 Grade 12 Begin Grade 13 

Year 2 Year 3 Year 4 Year 5 
ee eee 
Continued 4755 (97.3%) 4405 (90.2%) 4158 (85.1%) 3619 (74.1%) 
Dropped Out 86 (1.8%) 166 (3.4%) 160 (3.3%) 504 (10.3%) 
Transferred 45 (0.9%) 184 (3.8%) 87 (1.8%) 338 (6.9%) 
Graduated 425 (8.7%) 
“Mortality” 131 (2.7%) 481 (9.8%) 
Totals 4886 (100%) 4886 (100%) 4886 (100%) 4886 (100%) 


between scheduling pattern and enrollment status controlling for academic 

level. 

Table 2 shows a statistically significant relationship existed between 
school scheduling pattern and enrollment status. The dropout rate was 
significantly higher for schools with traditional scheduling patterns than for 
schools with semestered scheduling patterns. 

Table 3 shows that a statistically significant relationship existed between 
student academic level and enrollment status. The difference occurred main- 
ly between advanced-level students and the other two groups; there was little 
difference between general-level and basic-level student dropout rates. 

When the relationship between school scheduling and dropouts was 
tested controlling for the effects of academic level, as shown in Table 4, the 
following interaction effect emerged: 

1. There was no statistically significant relationship between school schedul- 
ing pattern and enrollment status for advanced-level students. That is, 
there was no apparent effect on dropout rates attributable to scheduling 
patterns among this group of students. The advanced-level dropout rate 
in semestered schools was slightly higher than the advanced-level dropout 
rate in traditional schools. 


Table 2 
Enrollment Status by School Scheduling Pattern 


Enrollment Status 
Scheduling Pattern Not a Dropout Dropout 
a ATER LETS. a ENS Nes 
Semestered Pattern 2161 (91.0%) 214 (9.0%) 2375 (100%) 
Traditional Pattern 2221 (88.5%) 290 (11.5%) 2511 (100%) 
Totals 4382 (89.7%) 504 (10.3%) 4886 (100%) 


Chi-Square = 8.503 (df= 1, p = 0.0035) 
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Table 3 
Enrollment Status by Academic Level of Students 
Enrollment Status 

Academic Level Not a Dropout Dropout 
Advanced-Level 2546 (96.0%) 106 (4.0%) 
General-Level 1589 (84.0%) 302 (16.0%) 
Basic-Level 223 (85.1%) 39 (14.9%) 
Totals 4358 (90.7%) 447 (9.3%) 

Table 4 


Enrollment Status by Scheduling Pattern by Academic Level 


Academic Level 


Advanced-Level General-Level Basic-Level 
Scheduling Not a Not a Not a 
Pattern Dropout Dropout Dropout Dropout Dropout Dropout 
Semestered 1274 60 760 124 118 af 
(95.5%) (4.5%) (86.0%) (14.0%) (91.4%) (8.6%) 
Traditional 1276 46 829 178 105 28 
(96.5%) (3.5%) (82.3%) (17.7%) (78.9%) (21.1%) 
Total 2546 106 1589 302 223 39 
(96.0%) (4.00%) (84.0%) (16.0%) (85.1%) (14.9%) 
Total 2652 1891 262 
Chi-Square = 1.754 Chi-Square = 4.671 Chi-Square = 8.109 
(n.s.) p=0.031 p=0.004 
2. There was a statistically significant relationship between school schedul- 
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ing patterns and enrollment status for general-level students. That is, 
there might be an effect on dropout rates attributable to scheduling 
patterns among general-level students. The general-level dropout rate in 
semestered schools was lower than the general-level dropout rate in 
traditional schools. 

There was a statistically significant relationship between school schedul- 
ing patterns and enrollment status for basic-level students. That is, there 
might be an effect on dropout rates attributable to scheduling patterns 
among that group of students. The basic-level dropout rate in semestered 


schools was much lower than the basic-level dropout rate in traditional 
schools. 


Student Dropouts and Scheduling Patterns 


Table 5 focuses on the dropouts exclusively. That table shows the percent- 
age of students originally enrolled in each academic level in each scheduling 
pattern who dropped out of school. Insert Table 5 Here 

Table 5 shows clearly, within academic levels, that the dropout rate 
(percentage) for advanced-level students in semestered schools did not differ 
markedly from the rate for advanced-level students in traditional schools. 
The dropout rate among general-level students in semestered schools was 
markedly lower than the rate among general-level students in traditional 
schools. The most marked difference in dropout rates occurred within basic- 
level students; the dropout rate for semestered schools was far lower than the 
rate for traditional schools. 


Post-Hoc Analysis of the Data 


Two other relationships were examined. First, the relationship between sex 
and dropout rates was tested to determine if dropout rates—from the entire 
sample—differed between the sexes. There was a significant difference in 
dropout rates between males and females (Chi-Square=12.134; df=1; 
p=0.0005; N=4886). Controlling for academic level of students, the only 
significant difference occurred between general-level males and females 
(Chi-Square=5.725; df=1; p=0.0167; N=1891). Male dropouts outnum- 
bered females at both other academic levels, but not significantly. In total, 
303 (60%) of the 504 dropouts were male, a finding that replicates the rate 
reported by King et al. (1988). Expressed differently, 11.7% of total number 
of males dropped out while only 8.7% of females dropped out. 

Second, the relationship between school scheduling pattern and student 
credit deficit—the number of credits a student was below some norm—was 
tested. The notion of credit deficit was a major discussion point in Student 
Retention and Transition Project workshops run by Ontario Ministry of 
Education officials during the 1988-1989 school year. Credit deficit, or 
“credits behind” as named by M.J. Peart (personal communication, February 
22, 1989) could serve as a valid predictor concerning dropouts. The notion 
and the relationship, while not a primary purpose of the study, seemed 
worthy of investigation. The norm established for this test was 30 credits (the 


Table 5 
Dropouts by Scheduling Pattern and Academic Level 
(Number and Percentage of Original Enrollment) 


Academic Level 


Scheduling Pattern Advanced General Basic 
ee = eee LO ee EE eee 
Semestered 60 (56%) 124 (41%) 11 (28%) 
Traditional 46 (44%) 178 (59%) 28 (72%) 
ee ee ee eS eee 
Totals 106 (100%) 302 (100%) 39 (100%) 


Chi-Square = 11.839(df = 2, p < .0027) 
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criterion for graduation under OS:IS). The credit accumulation of a student 
was categorized within ranges above or below the norm. Table 6 summarizes 
the data associated with that test. 

Table 6 reveals a significant difference between scheduling patterns and 
credit deficits. It shows that the greater percentage of the students with 
credits accumulated above the norm (more than 30 credits accumulated) 
occurred in traditional schools. In each category of credit deficit ranging 
between 2-19 credits below the norm (13-28 credits accumulated) the greater 
percentage of students with those deficits occurred in semestered schools. At 
the high end (13 credits accumulated) and the low end (29-30 credits ac- 
cumulated) of the credit deficit scale, the greater percentage of students with 
those deficits occurred in traditional schools. The relationship of credit deficit 
(or accumulation) to scheduling pattern appears complex but, in general, the 
greater percentage of deficiency in each category occurred in semestered 
situations. Students in traditional scheduling patterns appear to accumulate 
credits more rapidly than in semestered situations. 


Discussion 

Undertaking this study was, to a large extent, like shutting the stable door 
after 75% of the horses had bolted. While the study simply establishes that a 
relationship did exist, it may provide a focus or stimulus to study more deeply 
the characteristics of teaching and learning in different school organizations. 
Replicating the study at the end of a cohort’s fifth year in high school might 
also allow further verification of dropout rates. It is possible, however, that 
the opportunity to replicate this study might not exist for long, in Ontario, if 
the shift to semestered scheduling patterns continues at its present rate. 
Perhaps better, some studies of causal relationships between effective in- 
struction, such as reported by Rosenshine (1983), and dropping out could be 
conducted within or between school types to yield findings that might con- 
tribute to making schools more effective. 

The study did establish the wealth of objective archival data available to 
researchers, and also the willingness of school boards to share those data 
where practical. There is no easy way of certifying the reliability or validity of 


Table 6 
School Scheduling Pattern by Credit Deficit 
Credit Deficit 
30-25 24-20 19-15 14-10 9-5 4-2 1-0 >Norm 
Semestered 30 95 194 238 361 433 653 353 2a5r 
40% 45% 53% 52% 58% 56% 44% 43% 48.8% 
Traditional 44 114 172 224 265 347 834 469 2469 


60% 55% 47% 48% 42% 44% 56% oF dy te aE 


Totals 74 209 366 462 626 780 1487 822 4826 
1.5% 4.3% 7.6% 9.6% 130% 16.2% 30.8% 17.0% 100% 


Chi-Square = 66.165, df=7, p<0.0001 
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the data, but they are likely no less reliable, in totality, than data collected 
through any other means. A major limitation of archival research, though, is 
that no additional information can be gained concerning causes or other 
factors not contained in each record. 

The post-hoc finding concerning credit deficit and school scheduling pat- 
tern (Table 6) was intriguing. Explaining that relationship in terms of the 
supposed benefits or opportunities afforded by semestering might prove 
difficult. Sharman (1989) found that grade 9 students in a semestered sys- 
tem were less satisfied (because of pressure) than grade 9 students in a 
traditional system, but in grade 10 semestered student satisfaction had 
increased. Perhaps grade 9 students experienced enough difficulty adjusting 
to semestering, a system that is markedly different from their middle school 
experience, to cause an initial difference in credit accumulation. The overall 
difference in credit accumulation might occur in grade 9. That could also 
provide a research topic of interest. 

This study did not set out to measure school effectiveness per se, which is 
a very complex phenomenon (see Bryk & Thum, 1989). If dropout rates, 
however, form a valid outcome measure of school effectiveness, it might 
appear, superficially, that one form of scheduling pattern is more effective 
than another for certain types of students. A statistically significant rela- 
tionship was found to exist between school scheduling patterns and dropout 
rates, and the study, therefore, achieved its purpose. Whether that rela- 
tionship is deemed to be practically significant rests with each reader, al- 
though readers should always consider that what goes on in classrooms 
might be determined, in part, by the structures in which those classrooms are 
nested. Classrooms as organizational subunits are not likely the “closed 
systems” that writers such as Lortie (1975) might lead one to believe. 


Notes 


1. This study was supported by the Provincial Office of the Ontario Secondary School 
Teachers’ Federation through a J. W. Ansley Memorial Award for research. 

2. The author gratefully acknowledges the invaluable support and assistance of the 
Directors of Education and Computer Services personnel of the Board of Education for 
the City of Windsor, the Carleton Board of Education, the Durham Board of Education, 
the Northumberland and Newcastle Board of Education, and the Waterloo County Board 
of Education. 

3. This study will be replicated using the same schools but implicating students who entered 
grade 10 in 1987. 
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A traditional classroom teacher led a group of children in an informal afterschool 
drama program. His growth in the use of drama teaching strategies and his 
understanding of the role of drama in empowering children to create are the focus 
of this study. The themes that emerged from the study were growth in the ability 
to develop story schema through drama experience, the use of drama teaching 
strategies, the use of imagery as a teaching strategy in the drama experience, and 
the development of language through use in a meaningful and purposeful context. 
Implications drawn from the teacher’s journal, researchers’ field notes, and inter- 
views led the researchers to make connections between informal drama sessions 
and traditional drama teaching situations. 


Introduction 
Because little research has been completed that explores the value and 
effects of drama education in informal nonschool settings, the researchers in 
this study elected to follow a group of children in an afterschool informal 
drama program to investigate the themes and issues that emerged from the 
weekly drama classes. The new understandings of the teacher, which devel- 
oped over the course of this project, are the focal point of this article. 


Purpose of the Study 

The research project was specifically designed to provide direction for im- 
proving the use of drama as a tool for effective teaching. It was hoped that 
this would enable teachers, whether involved in formal classroom instruction 
or in recreational drama activities, to more readily perceive the value of 
drama as a way of knowing and enable them to be more comfortable in using 
drama as a teaching tool. To achieve this, the cognitive, affective, and artistic 
growth of children as they participated in a weekly afterschool drama pro- 
gram was investigated over a two-year period. 

This article explores only one aspect of the total two-year project, the 
growth of Sean, the teacher, as he experimented with drama as a teaching 
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tool, using one class of nine- to twelve-year-olds in the afterschool drama 
program. By exploring with Sean his new awarenesses of drama techniques, 
the researchers hoped to highlight new understandings that will benefit other 
teachers. 


Collaborative Component of the Study 
According to Swain (De Bevoise, 1986) working with the schools is a natural 
outgrowth of the university’s mission. “As an urban university, we have a 
self-conscious dedication to serving the surrounding urban area, to develop- 
ing and enhancing our linkages with the community” (p. 10). 

With this concept in mind, the Departments of Drama, Secondary Educa- 
tion, and Elementary Education at the University of Alberta collaborated to 
form a University/Community Children’s Drama Project to foster the devel- 
opment and support of child drama and drama education. In order to do this 
a new collaboration was established with a classroom teacher from Edmon- 
ton Public Schools and with a member of the Windsor Park Community 
League. A class of children, drawn from the university neighborhood, met for 
one hour per week for two years (48 one-hour sessions covering four univer- 
sity terms). 

All three components, university, school, and community, collaborated on 
creating the drama project, planning the specific classes, carrying out the 
practical aspects of the program’s organization, and evaluating the program. 

Two university members of the project initiated a research component to 
explore the educational drama implications of these meetings. As the re- 
search developed it became clear to the researchers that it would be neces- 
sary to redefine the use of the term collaboration as it pertained to this 
project. At its inception the project used the following definition: “collabora- 
tion is a term that implies the parties involved share responsibility and 
authority for basic policy decision making” (Hord, 1986). The researchers 
came to realize that the above definition pertained to the Drama Project 
itself, but the research work seemed to demand an evolving definition. 
Carson and Jacknicke (1988) elaborate the difference between collaborative 
and cooperative projects: cooperation meaning to work together and col- 
laborative meaning to work in conjunction. As further description demon- 
strates, the research component with Sean, the teacher, began as a 
cooperative project and evolved into a collaborative endeavor. 


Setting of the Study 

The class met in a community league building and used an upper storey, 
large, carpeted room as the drama space. Windows framed one side of the 
room so that both natural light and artificial light could be used. Sean, an 
elementary school teacher with nine years of elementary school teaching 
experience, led the drama. The children who participated in the first year had 
an age range from seven to eleven years. During the second year, two classes 
were organized: seven- and eight-year-olds, and nine- to twelve-year-olds. 
The data presented in this article focus on one of these second-year classes 
only: the eight children aged nine to twelve years. 
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Methodology Used in the Study 

The most important piece of research data was the weekly journal kept by 
Sean containing his reflections on the class he had just led (Spradley, 1979; 
Craig, 1984a). In the second year, the researchers shared with Sean the 
themes that had emerged from the first year’s data in order to focus his 
observations on these aspects. However, he was still given the freedom to 
reflect on other areas of concern emanating from the class periods (Rowles, 
1978; Agar, 1980). 

A second source of data consisted of field notes taken by an in situ 
recorder (research assistant trained in ethnographic research methodology, 
Spradley, 1979). This assistant observed each class and collected factual 
material describing the teacher’s strategies, lesson sequencing, questioning 
modes, and student responses and involvement. 

Third, the two researchers conducted informal interviews (Spradley, 
1979) during the four terms, at the six-week and 12-week points. These 
interviews were conducted with six selected parents of the children and six 
selected children. The interviews were audiorecorded and transcribed. 

Last, the researchers interviewed Sean (Spradley, 1979) at four-week 
intervals over the four terms. These interviews took the form of informal 
dialogues, often emanating from questions or statements made in Sean’s 
journal. These also were audiorecorded and transcribed. 

These triangulated data were collected and, as themes emerged through 
repetition in each of the data bases, material was coded according to these 
emergent themes (Glaser & Strauss, 1967). A schema was drawn up indicat- 
ing the connections from the data both to drama teaching strategies and to 
language arts (the fields of expertise of the two researchers). The transcript 
of these emerging themes was shared with Sean so that his reactions to these 
perceptions could be used to verify the findings. Latitude was allowed during 
the second year for any new themes that might emerge as the acts of coding 
and interpretation continued (Werner & Rothe, 1979). 

In the first term of the project, the researchers identified the themes and, 
in interview, shared these findings with Sean. This was a cooperative step. As 
Sean began to articulate the findings from his journal the mode became more 
collaborative. The teacher and the researchers together established the focus 
for further data collection. 

Because this project fits under the heading of “Human Science Research” 
(van Manen, 1990) and because human beings were the subjects of the study 
and their words and actions were to be used, it was important that the 
research meet the five guidelines of the Research Ethics Committees of both 
the Department of Elementary Education and the Department of Secondary 
Education at the University of Alberta. Both committees gave their approval 
to this project (University of Alberta, 1985). 


Emergent Themes 
The specific themes identified by the researchers and verified by Sean during 
the course of this study were: 
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- Sean’s growth in the ability to identify story possibilities and of the child- 
ren to develop stories through drama experience. 

- Sean’s expanded awareness and use of drama teaching strategies. 

- Sean’s development of imagery as a part of the drama experience. 


- Sean’s role in the children’s language development through its use in a 
purposeful context. 


Theme One: Story Development 


Story is a basic way of organizing human experiences, a framework for 
learning.... Drama helps children journey inside the story garden, so that they 
can reconstruct the symbols, images, and narrative sequence. (Booth, 1985, 
pp. 194-195) 


In looking at the development of the story and the origin of the story 
ideas, a number of points emerged in several data sources. Sean was con- 
cerned not to impose his ideas onto the children. He wanted to accept their 
ideas and interests and work with them. It was important to Sean that the 
children work with a theme and a story that was meaningful and interesting 
to them. In his first interview of the second year of the study Sean said: 


I’m not dictating curriculum, like I am at school, so the situation is different, 
and I didn’t want to come in dictating curriculum, but I, as well, didn’t want 
to get caught in the kind of trap I got into last time. 


Sean was aware of the problem he had encountered in the first year’s 
teaching at the drama center—that of pursuing a theme or story line with the 
children which he thought they would want and resisting what the children 
were themselves suggesting. Sean therefore made a strong commitment to 
listen to the children’s preferences and to try to work with them. However, in 
the second or third session of year two he made an interpretation of the 
children’s responses that took the drama in a direction that was not comfort- 
able, or meaningful, to all the children in the group. Sean felt a strong sense 
of purpose and direction and forged ahead with a “space” theme. He designed 
a space station and the children were to react to it. Although he was con- 
cerned about building group identity, his theme only served to split the group 
into two distinct factions, those who wanted to go to space and those who 
rejected this idea for the story line. Sean initially resisted the suggestions of 
a group of children who opposed the theme but soon realized that the drama 
could not work in those circumstances. He reluctantly had to abandon the 
space theme, going back to the theme of people in conflict and the problems 
of “Life in Chocapple Street,” which had been the theme developed by the 
children at the beginning of the term. 
Sean wrote in his journal: 


Ihave let go of the going to space, but hang on to our city of devastation. I was 


not going to fight them on the space theme. I had been fighting them, I think 
now. 


sad Sean also realized that some children had been absent during some of the 
initial group experiences. When these children returned to class, they were 
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reluctant to accept decisions that had been made in their absence and to 
participate in the story. The absence or presence of one or two children each 
week had a strong influence on how the story developed and what decisions 
were made. Also the background and interests of each child made an enor- 
mous difference to what happened to the story and the way the children 
approached it. 

Sean made a number of observations: 

1. The value of shared experiences in a group drama cannot be underes- 
timated in regard to building level of belief and a meaningful story. 
2. The effect of unstable attendance at the center was a cause of discon- 
tinuity and fractured the group. 
3. Personal differences among the children regarding interests and ways of 
working must be taken into account. 
4, The children worked most effectively in the “now,” and when they were in 
action. 
This latter point was highlighted in the middle of the term, just before 
Halloween, when Sean took the emphasis off problem drama and did a “game 
type” of session, where the children created a monster. The children enjoyed 
it immensely and worked together well as a group. Sean observed that the 
children worked together well when they were working in the “now,” and he 
commented that “they had trouble identifying with those pictures ... in the 
past. It was all discussion. It needs that flavor of action.” (He had taken in 
pictures in year one as a way of getting the drama started.) He further said 
that it was quite clear that the children still had some interest in what they 
were doing (even though there was dissent in the group). Sean speculated 
that it was possibly just the thrill of being together and drawing a picture 
together, but from these observations it appeared that the children did not 
want to plan and discuss, or go back into the past, but wanted to get into 
action and do something. 

By the final term of the project Sean had developed many insights into his 
work at the drama center. He realized that he had been controlling ideas and 
activities to a much greater degree than he had imagined. This giving up of 
control to the children was a major step for Sean in his work that year. Sean 
felt that he was able to go into the last term more open to the children’s ideas, 
yet ready to provide stimulation and facilitate the children’s work in an 
open-ended way. As a result of this, according to Sean, the children developed 
a high level of engagement in the story and their role in it. Therefore, the 
work became more focused. He felt there was little conflict and that there 
developed a solid group cohesiveness. He also said, 

Last term I had trouble getting a direction going... This term I went in 


differently. I went in without my pictures under my arm. I went in with a half 
statement which was “Standing on the edge....” I went in with just an idea. 


(Interview) 


Sean ensured that everything he structured for the children consisted of 
common shared experiences, but many of the ideas, such as the time travel- 
ing machine used in the last term, came from the children’s individual 
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drawings. After a reading from the first part of a book called The Laughing 
Lion (Pearson, 1921) the children were asked where their time traveling 
machine took them. Their responses varied from San Francisco turned into a 
Fantasyland, to dinosaurs, to nuclear war, and Sean’s aim was to explore all 
of these through drama. 


What seems to be clearer to me this term as opposed to last term, is definitely 
the idea of the story—it’s stronger. There’s different personalities there, and 
time travel doesn’t mean space. (Interview) 


The changes in Sean’s approach to the drama sessions, according to his 
analysis, were enormous. Many of the children’s ideas were incorporated into 
the sessions, and this resulted in less stereotyped dramatic responses.“To 
some extent, [teachers] must be prepared to put themselves in their pupils’ 
hands while retaining the functions, duties and responsibilities of the teach- 
er” (O’Neill & Lambert, 1982, p. 21). This was happening for Sean. He was 
beginning to recognize the “journey inside the story garden.” 


Theme Two: The Use of Drama Teaching Strategies 


Teachers are becoming aware of the elements which drama and theater 
share—elements which have long been recognized as the essential “clay” of 
the medium by the most effective theorists and practioners. (O’Neill, 1985, p. 
158) 


The tools that form the basis for the craft of theater have been classified in 
various ways. For the purpose of this study we used the classification sug- 
gested by Dorothy Heathcote (Wagner, 1976 p. 154), a noted drama educator. 
To prove that classroom drama uses the elements of the art of theater she 
proposed six basic theatrical tools as the spectra used in the development of 
theater and adapted by the classroom teacher who leads students in the art 
of drama. These are darkness/light, silence/sound, and stillness/movement. 
As the classroom strategies used in this study were analyzed it became clear 
that all of these elements were part of the drama experience. 


Darkness/Light 


The physical aspects of the room did not of themselves lead to an easy 
application of this theatrical principle. There was only general illumination. 
No spotlights or dimming devices were available. The windows along one side 
of the long room made for bright illumination as part of the general environ- 
ment. Sean used the command, “Close your eyes” frequently. Sometimes the 
command preceded an imaging exercise. At other times, the darkening of the 
space by the closing of the eyes preceded side-coaching to build environment 
or mood, or to create a bodily reaction. 

At other times, darkness was created by a blindfold so that one person 
could not see in order to carry out a theater game to build tension. An 
example would be the adaptation of the Keeper of the Keys game which Sean 
used to build supersonic ears for the space station. 
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Silence/Sound 


The second set of principles which form the basis of theater were more 
possible in the space in which the class was held. From the first meeting when 
certain Name Games were introduced, sound was part of these sessions. The 
children were urged to be articulate, to respond, to discuss, to brainstorm for 
ideas. Speech drills, breathing drills, articulation exercises, and patter drills 
were all part of the warm-up exercises. 

Storytelling became part of the session as Sean told stories to instigate 
and evoke responses. The circle story was introduced where the children 
collectively created a story. Sound stories were used, as were machine exer- 
cises where sound was added. 

In contrast, periods of quiet, of silence, were also part of the training. 
Frequently, the side-coaching included, “Concentrate,” “Think,” “Be still,” 
“Decide for yourself,” “Turn down the volume,” “Listen,” “Make your plan 
in your head,” “You’re in a dark, quiet place” (Field Notes). 

The balance between sound and silence formed the basis for this class. 


Stillness/Movement 


From the first day of class, movement or complete involvement of the body 
was part of the class activity. The movement evolved from the exercise 
variety, as Sean directed: 


Take fast tiny steps. 

You are the smallest shape, the biggest shape. 

In a glass box, gravity is a problem. You are weightless. 

We are the fountain at West Edmonton Mall. (Field Notes) 


to the more complex movement directions: 
We’re going to make a machine with 11 parts. (Field Notes) 
to movement-in-role: 


Line up everyone! I’m your sergeant. You’re going to enter weightlessness. 
Have any of you been there before? (Some nod.) Then you know you have to 
puff up your cheeks and learn the techniques for propelling yourself. (Field 
Notes) 


Stillness was encouraged from the first class with the use of the word 
“freeze.” The use of time-count encouraged stillness and then fast move- 


ment, for example, 
Meet me on the count of 5. (Field Notes) 


The technique of tableaux was another use of stillness which led to under- 
standing and meaning. A frozen picture, created by a single child or a group 
of children working together, was utilized from the second class onward, for 


example, 


Make a frozen picture of your hands working your machines. (Field Notes) 


All six of the elements which form the basis of the theatrical experience 
were present in the drama class, and yet other teaching techniques were 
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utilized which defy classification into one of these categories. These are 
discussed in the following drama strategies which emerged as part of this 
theme. 


Boundaries 
Important to these classes was the sense of restrictions placed on the stu- 
dents for the general order and organization of the class. It was found that 
space boundaries (the circle, the use of a rug) and certain sound controls (a 
cymbal, a tambourine, counting off, touching the shoulder, a clap, the word 
“freeze”) established class rules and set a sense of security about the 
work/play. Time boundaries were also present, as Sean side-coached: 


Meet me at the count of 5. 
When I count to 5 things will be worse. (Field Notes) 


Boundaries worked so well that, in time, only one command, “Circle,” was 
followed by the children instantly which saved time and allowed the short 
class to be utilized to its best advantage (Tarlington & Verriour, 1983, p.14). 


Depictions 
Another strategy used frequently in this program was that of depiction. It 
was used to open or close a session. For example, Sean brought in pictures of 
the past for the students and he reflected: 


I ... laid out a sequence of 7-8 pictures of city scenes. Actually the same 
location that over a series of years went through major modernizing change. 
(Journal) 


The children were asked to study the pictures and write down three things 
which impressed or interested them in the picture or pictures that they chose. 
Each week, the class began with the students working on their picture or 
word list from the week before, or by decorating their folders. Some days the 
pictures were used as part of the drama. At other times, this device seemed to 
be used as a bridge or transition to bring the class back to the theme being 
developed after a week’s distance from the story line. 


Use of Role 

The techniques of endowing students with a role and of the teacher stepping 
into role in order to further the action of the drama were used frequently. 
Sean asked the class to take on the mind of the inventor, to create machines 
for the future. They were made experts on internal design in space. At 
another time they were space center captains and new recruits in a paired 
exercise that moved into role reversal so that each participant would experi- 
ence both roles. 

The teacher took on several roles, some clear-cut and obvious, others in 
the grey area which allowed students to determine the role confronting them. 
“I’m your sergeant” is a clear exposition of the role being donned. “Everyone 
line up,” followed by a dropping of role to say, “Who do you think was 


ere to you?” puts the control of the role into the students’ hands (Field 
otes). 
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Another development in role usage was asking students to quickly leave 
one role and don another. 


Shake that role off. You are now a group of protesters. (Field Notes) 


Later in the term role reversal was at the heart of the drama experience 
where the class explored both the role of the people in a kingdom they had 
created, and the people of the valley near the kingdom. Roles were used to 
make a real life connection from one drama experience to the other. 

Sean had utilized the six elements of the art of theater and had also 
recognized as part of “the essential clay of the medium” other teaching 
strategies such as boundaries, depictions, and role play. 


Theme Three: The Use of Imaging as a Teaching Strategy 
in the Drama Experience 


Drama enables us to discover the heart of story through its images. (Haine, 
1985, p. 187) 


Imaging, the closing of the eyes to see on the white screen of the mind, 
appeared to be an important strategy, one to be explored independent of, and 
yet integrated with, other drama teaching strategies. It was used by the 
teacher as a method of helping the children to believe more firmly in the 
drama being explored. However, it was also extended to other uses. 

In one drama experience the use of imaging became a factor in the growth 
of belief in the drama moment. Early in the term Sean was leading a 
warm-up exercise involving imaginary balloons. His side-coaching included 
the words, 


In drama, we have to imagine things that aren’t there and believe them ... 
watch the balloon, what color is your balloon? (Field Notes) 


He worked to understand the children’s responses to the warm-up exercise, 
to sort out the signals they were making to his side-coached directions: 


There was no common image of the machine. They each had an image. Some 
had no image, just a manufactured movement bit, but involvement, nonethe- 
less. (Journal) 


The use of this strategy became clearer as the children developed the ritual of 
the building of the space craft, which was used frequently. As Sean attempted 
to deepen their belief in the craft, the side-coaching opened avenues for 
imagination development. 

Imaging was also used as a method of organizing, of planning ahead, of 
sorting out ideas, and of gathering evidence. At one point Sean commanded: 


Freeze. Close your eyes. Make your plan in your head. Think of evidence... 
Sit still. Close your eyes. Get a picture of something you see. People are being 
hurt. There’s not much time. I want us to know Chocapple Street ... 

In your own head, make up your mind what you'll say. (Field Notes) 


It was also used as a method of control, of calming down, and focusing on 
the work at hand with seriousness and purpose. He side-coached: 
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Sit in a still position. Close your eyes everybody. Take all the silliness out of 
your mind. You’ll need your seriousness. (Field Notes) 


Perhaps the most novel use of imaging, and this may be one of the 
breakthroughs of this study, was the use of artwork in the class as a way of 
visualizing in which directions the drama might move. The pictures, child- 
conceived and drawn, became the next step in the drama sequence and were 
combined with imaging exercises in the drama experience. 

Sean realized that five of the children’s drawings were depictions of 
action on which future parts of the drama could be built. One boy had 
depicted two dinosaurs in battle just over the ridge on which the time 
machine had settled. Sean said: 


(Paul) probably twigged in really early that I was using ideas off his page. But 
to the rest, it wasn’t that at all. To somebody else it was “well, I’ve got this 
sense that we built this orchestrated monster fight over the hill that we 
couldn’t see”.... And I’ve got five more visions on paper that are vastly 
different ... The whole lesson yesterday was structured around his (Paul’s) 
picture—a lot of imagery, a lot of parallel play ... in being in this heavily jungle 
area. (Interview) 


Haine (1985) insists that drama is “image work” (a phrase he borrows 
from Hillman’s writing, 1983). By this he means that “participation in 
drama draws on both conscious and unconscious imaginative capabilities and 
involves teacher and student in the bedrock of human reaction” (p. 187). In 
this study, Sean used imaging strategies both in the drama per se and in the 
artwork adjacent to the story-building. He seemed to view drama and par- 
ticipation in it as image work. However, he exercised the precautions that 
have been suggested for all drama teachers who use imaging as a teaching 
strategy (Craig, 1984b, p. 14). These precautions are that the side-coaching 
be done carefully, seeding the image the leader wishes the children to have or 
to develop in the mental exercise. The questioning should be done specifical- 
ly, with a purpose for the words used. Opportunities should be offered for the 
children to share verbally or in artwork the images that they have experi- 
enced. Timing of the imaging exercises should be short. Last, before and after 
the imaging exercises, action sequences should be structured into the lesson. 

Sean had discovered imaging as one way to “the heart of the story.” 


Theme Four: Language Development 


If language development is to be the central focus of a classroom drama, then 
reflection about language should also play a major role in the planning and 
implementation of a dramatic context. (Verriour, 1985, p. 182) 


The language of the drama sessions appears to have followed some dis- 
tinct patterns. At the very beginning of the first interview held with Sean, he 
asked the question, as much to himself as to anyone, “Is language use a 
teaching strategy?” He answered his own question by saying, “I think so—it 
is as integral as using a tambourine as a control device; extending the 
language that they give me all of the time.” Here, Sean was referring to 
Heathcote’s idea of “upgrading” children’s language (Wagner, 1976). Sean 
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said that Heathcote’s ideas and words rang in his ears as he led each session. 
He had studied with Heathcote, observed her work with young people, and 
dialogued with her as a teacher (Interview). This was certainly apparent in 
Sean’s interaction with the children during the drama sessions as he showed 
an effort to upgrade children’s language. During one class, for example, these 
exchanges took place between Sean and the children: 


Ann: It might be possible to go underwater. 

Sean: Ah, you’re suggesting another environment. 

Greg: I heard yesterday that other countries are also having problems. 
Sean: Ah, the problem is global. (Field Notes) 


He also assisted them in clarifying their thoughts. In one interview Sean said: 


So that’s a goal too, so that they can talk about and be expressive with these 
problems ... whether they’re role playing or reflecting. 


In his journal, Sean wrote that the “ages are closer” (than last year), “we 
can rely more on the written word and expand from the crayon drawn 
picture.” “The older group allows one to say ‘write’ instead of ‘draw’; another 
dimension I shall capitalize on throughout.” In the field notes for the classes 
it is reported that Sean did ask the children to write as a way of reflecting on 
their experiences in the drama, or as a way of clarifying their thoughts. The 
children were asked to write down in their folders the things that had 
impressed or interested them, and Sean reminded the children that this was 
a “thought exercise.” He asked them to write in sentences and not just in 
single words. This language-thinking connection was apparent throughout 
Sean’s work in the drama sessions. The children were often asked to list, for 
example, things that had changed in the last 100 years, and they were often 
asked to brainstorm ideas that were then recorded on chart paper, for 
example, imagining what kinds of adventures they could have in space. 

Although the children were generally described by Sean as being articu- 
late, he expressed some concern over the level of oral language interaction 
that occurred during some of the drama sessions. At one point (in an inter- 
view) Sean commented that the articulation and language level was a “bit 
static now.” He said that he thought the challenge of the idea had gone and 
that he had “dragged it on too much.” He thought there seemed to be too 
much repetition. In a later session the children did not respond well to the 
questions put to them, perhaps because there seemed to be a “right answer” 
being sought. The children did not appear willing to risk. The field notes state 
that the children did not seem to be interested. Their responses to questions 
were without life. They were not serious and didn’t seem to be involved. Sean 
was aware that at some points he was pushing an idea that the children were 
not fully willing to receive. He felt the tension, and the children’s unwilling- 
ness was reflected in the ensuing discussion or lack of discussion. He also had 
the awareness to encourage the children to voice opposition and to make 
their feelings known. 

The overriding message that emerges from these interactions is that 
children must have something real and urgent to talk about if they are going 
to engage in any meaningful and intense discussion. Children do not talk for 
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the sake of talking. They must feel ownership, they must feel they are being 
heard, and they must have a sense of purpose in their discussions and 
dialogues. If children sense that the situation is artificial and there is no real 
purpose in what they are doing, they are not likely to engage in meaningful 
interaction. 

The key words that emerge throughout this study are “challenge” and 
“interest.” It appears that if the children were challenged (rather than 
pushed) with issues, material, and situations that were of interest to them, 
the language that was generated was at a far higher level of intensity and the 
children were much more engaged in the drama and the discussion. It is this 
level of engagement that appears to be at the heart of the language use. Sean 
reflected on the language development “in the planning and implementa- 
tion” of the drama sessions. 


Reflections on the Growth of the Teacher 
Sean used his journal to write some final reflections as he concluded a term 
of leading the children at the Community League. He wrote: 


It has been a great learning experience for me as a teacher. I’m looking back 
and beginning to be able to step back and away from the work and gain an 
objective analysis.... I’m ready to continue along this road to growth and 
discovery to make more natural my work with young people. (Journal) 


As we looked back over the themes we investigated, we found that al- 
though each theme could be analyzed separately there were underlying con- 
nections. These connections opened the way to dialogue between the 
researchers from which new understandings emerged: understandings of the 
teachers, of the children, and of the techniques and strategies that make 
drama such a valuable teaching tool. Sean’s journal reflections and the 
teaching strategies he explored during the drama sessions led the way to this 
connection-making. 


Empowerment of Children 


Sean came to realize that drama teachers may have to “put themselves in 
their pupils hands while retaining the functions, duties and responsibilities of 
the teacher” (O’Neill & Lambert, 1982, p. 21). “The teacher must be 
prepared to enter the children’s world once they begin to work in the 
medium” (Heathcote, 1984, p. 47). Early in the project Sean was committed 
to avoiding the dictation of a curriculum to the children and yet he saw that 
he had fallen into this trap. Later, Sean again made a strong attempt to avoid 
this pitfall. During the latter months, however, Sean reported that he was 
aware of this error and realized that he was able to stimulate the children’s 
ideas more thoughtfully and use these ideas in the drama sessions. He felt he 
was able to take that great and risky step of passing control to the children, 
and the children’s responses were equal to the task. Sean had managed to 
enter the children’s world and to put himself “in his pupil’s hands.” 

Sean also realized that the more control or input the children have with 
the ideas of the drama the more engagement there is and the deeper their 
level of belief in the story. This also applies to the language used. When the 


348 


Collaborative Drama Research 


children were put in situations where they needed to use language to meet 
their own needs in the drama situation, then the language was richer and 
more fluent. When responding to teacher statements or teacher questions the 
children’s language was more perfunctory and far less complex. Sean focused 
on the connection between language and thinking throughout the drama 
sessions and clearly saw the connection between ownership of ideas, the 
thinking demanded by the situation, and the overall language used. 


Use of Drama Teaching Strategies 


Sean’s key breakthrough in the early days of leading the class was in the 
importance of creative dramatics: the importance of the body, movement, 
and of all the things that are part of structured dramatic play, and also part 
of mime and movement. The importance of this insight affected his later 
work. Sean saw this not as games and movement versus problem solving, or 
as American creative dramatics versus British child drama, but as different 
parts of a whole. The one needed the other; creative dramatics was a way of 
creating greater belief in the drama, leading to understanding and a deeper 
level of drama participation. By exploring the tools which form the basis of 
the craft of theater (darkness/light, silence/sound, stillness/movement) and 
by identifying extended tools such as boundaries, depictions, and role play, 
Sean was expanding his use of, and comfort with, elements that drama 
practitioners have suggested to educators for decades. 


Imaging and Depictions as Powerful Tools 


A significant breakthrough for Sean in the class on which this article focuses 
was the realization that the artwork created by the children not only 
provided direction for the drama, but was in itself a powerful way of visualiz- 
ing or imaging. Several of the classes evolved from pictures the children had 
drawn, and the drama often led to the creation of pictures with powerful 
messages. Children were both suggesting new dramas and doing artwork as 
a result of the dramatic activity. A creative cycle was generated. Sean’s work 
at the center strongly supports Haine’s contention that drama is image work 
(Haine, 1985) that draws on both conscious and unconscious imaginative 
capabilities. 

Sean used the powerful strategy of imaging carefully. He guarded the 
children by using precautions as to the side-coaching, the images seeded, the 
use of questioning, and time for debriefing. 


Informal Drama Informs the Classroom Teacher 


During the entire project, Sean seemed to struggle with the concept of the 
drama teacher who plied his art in a formal school situation transferring his 
activity to the informal afterschool setting. A prevailing traditional class- 
room atmosphere was identified as a possible problem early in the project. 
However, Sean felt that working with a few children was much different 
than working with the larger number in a regular classroom. He questioned 
what constituted classroom-type behavior and expressed concern about how 
to get away from hand raising and other classroom traditions, as he worked 
more informally with the children. He felt that the kind of atmosphere he 
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wanted to establish went hand-in-hand with how he wanted the children to 
respond to him and to each other. He felt that the children in the drama class 
were very much on task and quick to respond, and that this was very much 
connected to the way that he interacted with them. This was also true in his 
traditional classroom, so Sean concluded that teacher-student interaction 
was of greater importance than group ritual or rules. He seemed to need a 
balance between formal and informal; certain structures were required in 
order to have a productive class session, but the atmosphere had to be more 
intimate and open for sharing. 

Another problem that plagues formal and informal situations is atten- 
dance. This problem was present throughout the drama project and was 
perceived by Sean as contributing to discontinuity. He also saw it as a 
contributing cause of disharmony and conflict. Children who missed sessions 
were not inclined to pick up the drama without questioning it on their return, 
and some children refused to go along with decisions that had been made in 
their absence. Sean realized that regular participation played a large part in 
the level of engagement in the drama, as well as in the harmony and spirit of 
collaboration within the group. 

How did Sean grow as he experimented with drama as a teaching tool? He 
clearly recognized that if children are to fully participate and engage in 
drama experiences, they must be empowered to develop stories, use purpose- 
ful language, and have control of their actions. The control must be placed in 
the children’s hands, and the stories and language should come from the 
children. Sean identified the drama teaching strategies by returning to his 
own roots as a drama student and by experimenting with theatrical tools 
which transformed the space in which the children worked and gave struc- 
ture to the drama experiences they had each week. Sean extended his use of 
visualizing or imaging. Children’s artwork focused many of the drama ses- 
sions as Sean imaged with the children and drew images from their depic- 
tions. 

Finally, Sean, as a traditional classroom teacher, found many ways in 
which the afterschool drama program informed his classroom teaching. 
Classroom behavior, attendance, teacher role, and student involvement all 
became part of his weekly exploration. Sean understood that if he was to 
work effectively with children in a classroom setting he must adjust his 
teaching to accommodate some of these new awarenesses that he had devel- 
oped in the afterschool drama project. Sean would agree with Verriour (1985) 
that: 


At a time when many educators are becoming increasingly dissatisfied with 
overly prescriptive models of teaching, drama is emerging as a powerful mode 
of learning through which students are encouraged to gain mastery over their 
own thinking. (p. 185) 
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Food for Thought: Realities of a 
Women-Dominated Organization 


This article reports on a case study of nurse educators in a Canadian hospital 
school of nursing. The study asked: Would organizational behavior in an or- 
ganization staffed largely by female professionals be significantly different from 
the typical, predominantly male, professional organization? The significance of 
the study for the organizational and administrative literature is presented. Using 
grounded theory, four major categories generated from the data are discussed. 
The data suggest that in a women-dominated organization a distinct view of the 
world emerges in the workplace that has major implications for organizational 
and administrative researchers and administrators. 


Would organizational behavior in an organization staffed largely by female 
professionals be different in significant ways from the typical, predominantly 
male, professional organization? In the past, most of the theorizing about 
organizations reflects an aspect of the social world that has been dominated 
largely by men. Research into the administration of organizations has gene- 
rally been carried out by men, in settings where men are the majority of 
actors, using strategies which are meaningful from a male perspective. 

Research into educational organizations has been no exception even 
though there are significant numbers of women in these organizations. The 
research indicates that the male perspective pervades most of the approaches 
to administration and organizational research in education. 

Only recently in the organizational and administrative literature has the 
idea been acknowledged that women may bring distinctive views of the world 
to the workplace. The ways women’s views might influence organizational 
culture have not been explored in research. A specific arena in which the 
influence of a female perspective might best be examined is in an organiza- 
tion staffed largely by women. Because 98% of nurses in Canada are female 
(Okrainec, 1986, p. 16), and nursing is one of the oldest and largest women- 
dominated professions, studying a nursing organization is ideal for exploring 
administrative aspects of a women-dominated organization. This article 
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reports on a case study of a Canadian hospital school of nursing. It attempts 
to reappraise old administrative theories, to concentrate on aspects of ad- 
ministration that may have been ignored or taken for granted, and to en- 
courage an alternative mode of thinking about organizations. It is hoped that 
the current work will contribute to a more inclusive understanding of the 
work of women in organizations in general and have implications for the 
study of educational organizations. 


The Context of the Study 

In the past, researchers have made little attempt to include females in their 
studies. Generally, males “have assumed implicitly a kind of sexual sym- 
metry” (Bernard, 1981, p. 14), that is, everything that exists for males also 
exists for females. For example, two instruments that in the past have been 
frequently used in educational administration research, the Leader Be- 
haviour Description Questionnaire (LBDQ) and LBDQ-XII (Halpin & Winer, 
1957), are both sexist in theory and construction. “Using the male pronoun 
throughout and validated with men, these instruments have been used to 
judge the performance of female administrators” (Shakeshaft, 1981, p. 22). 
There has been little attempt to adapt these instruments to female ad- 
ministrators. 

Kohlberg (1981) assumed sexual symmetry in his cognitive development 
model of moral reasoning which is largely based on a study of 84 boys whose 
development was followed for 20 years. He assumed that his findings could 
be generalized to girls; however, Gilligan’s (1982) and Lyons’ (1983) research 
findings concluded there was a difference in the moral development of girls 
and boys. In referring to sociological investigation, Daniels (1975, p. 346) 
suggested that women were excluded from consideration in research studies 
because male scholars believed that what women did was trivial and there- 
fore unworthy of scientific exploration. This point could also be made of 
administrative investigation. For example, most research in educational ad- 
ministration has either left females out of the sample but generalized the 
findings to both genders or has included females and attempted to account 
for the effects of gender (Shakeshaft, 1987, p. 162). However, the research 
results are biased by the androcenterism of the original research (Shakeshaft 
& Nowell, 1984, p. 200). Spender (1981, p. 162) contended that, “the scale on 
which women have been omitted from educational consideration is massive.” 
For example, in surveying the sociology of education literature in Britain, 
Acker (1980) documented the absence of women (cited in Spender, 1981, pp. 
162-163). Of 184 articles published since 1960, 58% purported to be studying 
both sexes, but 37% had exclusively male samples while only 5% had all- 
female samples. Often these female samples were used to study mothers’ 
influence on their children. 

For the most part, knowledge of females has been limited to how they 
impinge on males. The neglect of the study of women, including women in 
organizations (Gould, 1980, p. 459), imposes limits on our intellectual 
horizons. As Gould stated, “regrettably absent are descriptive case studies 
and interpretive analyses of roles women play as creators or innovators of 
organizations” (p. 459), 
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Conducting the Inquiry 
In order to investigate the role women play in organizations, the faculty in a 
Canadian hospital school of nursing were selected for this study. This or- 
ganization was selected because such educational institutions annually grad- 
uate the largest number of nurses. Also, hospital schools of nursing come 
closest to being a cloistered female world; they typically are run by females 
and predominantly populated by females. 

The school of nursing consisted of 220 students and 40 female faculty 
members, all of whom had at least baccalaureate-level preparation. Three 
faculty members also had master’s degrees. The ages of the instructors 
ranged from 30 to 50 years. The marital status was 21 married, three 
divorced, 12 single, and one who described her living arrangement as cohabit- 
ing. The curriculum was taught by three teaching teams. Most of the activity 
in the school revolved around these teams. Although the study included all 
faculty members of the school, six faculty members served as key par- 
ticipants. When faculty members were asked to volunteer to serve as key 
participants, five instructors responded. The investigator asked an addition- 
al faculty member to participate. She was a full-time member of one par- 
ticular team, married with children, two aspects not fully represented in the 
self-selected sample. 

The key participants varied in age from 30 to 43. One was married with 
two children; one was a single parent; one was divorced; and the other three 
were single. These participants represented all teams. One participant repre- 
sented the faculty support services. The educational background of the six 
key participants was as follows: three hospital diploma graduates, two college 
diploma graduates, and one generic nursing baccalaureate degree graduate. 
Two participants had university education in other non-nursing disciplines. 
Five had worked for several years in a variety of hospital nursing units before 
returning for a post basic baccalaureate degree in nursing. Four were cur- 
rently enrolled in university courses toward a master’s degree. The role of the 
six key participants (Cecile, Janine, Martine, Odile, Simone, & Thérése) was 
crucial. They shared important information about the organization and ad- 
ministration of the school and helped the investigator focus on vital data. 

During the 12-month period in which the investigator was collecting the 
data, she attended meetings of all three teaching teams, full faculty meet- 
ings, and other committee meetings and spent considerable time with faculty 
members in the faculty lounge during coffee breaks and lunches. Data mainly 
consisted of three types: copious documents distributed to faculty, tran- 
scribed audiotapes of interviews with key participants, and field notes. Two 
interviews lasting from 60 to 90 minutes were conducted with each of the six 
key participants, one during the first six months and the second during the 
last month of the study. The key participants either confirmed the results or 
noted exceptions to the themes tentatively proposed by the investigator. The 
grounded theory approach was used to analyze the field notes out of which 
were generated five categories that pertained to the organization and ad- 
ministration of the school. Four of the categories are discussed here. The fifth 


359 


P. Valentine and G. McIntosh 


category was omitted because it was of a different order than the other 
categories. 


The Findings 
Food as a Facilitator of Relationships 


Food was a category that liberally peppered the 254 pages of single-spaced 
typed field notes. This category was thoroughly discussed during the two 
dialogues with the six key participants. The discussing, preparing, cooking, 
and eating of food were strong elements of this culture, and such topics often 
dominated conversations over lunch, coffee breaks, and during meetings. 
Food was seen as the panacea for many situations. It was used for celebrating 
births, birthdays, and marriages, for dealing with stress, for celebrating the 
departure of staff and faculty members, for expressing appreciation to a 
team for supporting a new instructor, for thanking hospital staff for working 
with students, for introducing new students to the school faculty and the 
administrators of the hospital, for celebrating the end of the term, for cele- 
brating various graduation events, for rewarding a student who participated 
in a school project, for Christmas gifts, as the impetus for exchanging recipes 
with colleagues, and as a reason for dieting. 

Food was seen as a lubricant for facilitating smooth colleague-colleague or 
faculty-student relationships. Initially, the investigator was struck by the 
number of times she arrived in the lounge or the meeting room to find some 
kind of goodie on display. For example, at the coffee break of the first team 
meeting of the term, three homemade desserts were served to first-year team 
members. 

Conversation often revolved around food. For example, the best buys in 
bulk food were discussed. Recipes were frequently talked about and ex- 
changed among faculty members and staff and were often posted on the 
bulletin board in the faculty lounge. During one meeting two duplicated 
recipes were distributed to every member including the investigator. One 
team celebrated each team member’s birthday with a cake. 

Food became the reason for faculty getting together in groups after work. 
Some faculty members met during the summer to can peaches and prior to 
Christmas to prepare poppycock and antipasto for gifts. Baking for 
Christmas was discussed by several instructors, one of whom brought in a 
sample of Norwegian cake she had served at Christmas. One faculty member 
who was taking a course on bread-making brought in samples of bread for 
her colleagues to taste. During one lunch hour, there was considerable discus- 
sion about how to make cabbage rolls. At another break, one instructor 
talked about making 600 perogies over the weekend. At another time, the 
preparing of pastry from pig’s fat from a colleague’s farm, the making of both 
homemade pasta and yogurt, and the buying of fresh sausages from a local 
farmer were discussed by four different instructors. 

During the dialogues, when the investigator asked the participants about 
the meaning of this emphasis on food, they replied: 


Odile: Like its a facilitator, almost communication, socializing, everything 
from the organizing of it to the eating of it. 


Realities of a Women-Dominated Organization 


Thérése: Maybe work, and the people here [are] a large part of people’s social 


life and so they don’t see it as an imposition because they are bringing food for 
their friends. 


Simone: I think that here it represents being social, bringing the group 
together, you eat together, you’re friends, you break bread ... we are all really 
good pals, we get along so well. 


Martine: Well, I think [for] women ... [itis a way of] establishing camaraderie. 


Janine: As far as having any great significance the food ... it’s a way of taking 
a few minutes out to do something that people enjoy and relax over.... It is a 
way of coping with stress. 


Cecile: The underlying reason for suggesting a drink or lunch or a breakfast 
meeting.... It is more informal and if everybody made the effort of going and 
sitting beside someone that they didn’t spend much time with, that maybe 
they would feel a little bit more comfortable with that person and, therefore, 
be able to express some feelings about working with them or be comfortable in 
giving constructive feedback. 


Social Events as Integrators 


Food and social events blended together and complemented each other. From 
the time the investigator started her field work, she found that instructors 
were involved in many social events, some of which were informally con- 
sidered mandatory while others seemed to be optional; some were directly 
related to the students while other events were faculty related. For example, 
there were baby showers, a shower for an instructor who was getting mar- 
ried, parties for those faculty/staff who were leaving, a “roast” for an ad- 
ministrator who resigned, a coffee party for students who had worked there 
during the summer, and coffee parties for two other people who had resigned. 
Other events were potluck suppers for the various teams, a faculty-student 
barbecue, a volleyball tournament, a Halloween party, a Valentine’s Day 
party, numerous student-faculty events leading up to graduation, several 
functions to celebrate the Christmas season, as well as coffee parties or teas 
for the hospital staff who helped with the nursing program and end-of-term 
parties for the entire faculty and staff as well as separate functions for each 
team. 

During the 12-month period the investigator was in the school, there were 
some 55 social events. Although these events were not mandatory for all 
instructors, and some were limited to particular teams, nevertheless the 
faculty were expected to attend a large number of events, often after work 
hours. 

Comments from three participants who attempted to explain the meaning 
of this category were: 


Janine: The year I started ... we had a potluck supper or some kind of a 
get-together among the first-year faculty. The stress level was really, really 
high ... we had a social thing about once a month. It seemed to be a time when 
we could have a laugh, we tried to minimize the work or the shop talk ... to 
appreciate people as people, not just as a teaching partner ... it’s a way of 
sharing something with other people in the group ... I think it’s a socialization 
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... to help try and reduce the stress somewhat to get to know each other 
better. 


Cecile: Well ... the social events that occur here bring the faculty closer 
together. 


Odile: The reason that social events are planned is an attempt to help 
establish happy relationships amongst faculty ... that socially doing things 
together is seen as a way ... of being ourselves, getting to know people... 
Certainly ... that’s one of the things on the agenda at the retreat every year 
and ... from my perspective that gets fostered. 


Support Mechanisms as Essential 


Food and social events acted as mechanisms for promoting social interaction 
as did support mechanisms, the next category to be discussed. Support 
mechanisms evolved as another category that characterized the school. It 
was not unusual for a faculty member to ask for support from her colleagues. 
For example, during one team meeting, the coordinator, after talking about 
how the stress level of faculty increased as the year progressed, added, “We 
need each other’s support.” Another coordinator thanked all the team mem- 
bers for their “support and assistance” when the team was short one mem- 
ber. She also discussed methods for coping with the “minimal support” 
accorded instructors during the supervision of students on the hospital units. 
At one team meeting when a coordinator announced that she had agreed, at 
the request of the students, not to post certain examination results, she asked 
for team support on this decision. All new instructors were assigned a 
“buddy”—a supportive person who would help to orient them to the school. 
These buddies helped them prepare lectures, attended their classes, and 
aided them with any problems they experienced in the clinical areas. Also, 
senior students in the preceptorship experience were “buddied” with hospital 
nursing staff members. 

The coordinators often demonstrated support by validating marks on 
papers, helping with evaluations, and generally aiding new instructors. A 
more seasoned instructor requested that the coordinator mark her papers to 
give her feedback (support) on her marking ability. A new instructor brought 
home-baked cookies in to thank the team for their support. Comments made 
to support an instructor when she was receiving feedback on her class were: 
“great examples”; “enjoyed the class”; “a capital job”; and, when one team 
was reviewing the year’s progress, “I think we did well.” 

Support was given to mothers when their children were sick. On two 
occasions the teaching schedules were changed to allow mothers to attend to 
their sick children at home. The emphasis on support carried over to the 
students. For example, one faculty member contended that during a senior 
rotation when the students were relatively autonomous and received little 
faculty supervision, the students “mainly wanted support.” The male nurs- 
ing students were paired together for support. 

When the investigator asked about the need for support in the work 
situation during interviews with the key participants, they all readily agreed 
that if they did not have at least one supportive colleague at work, they would 
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feel “isolated.” Simone told the investigator about a new faculty member who 
felt alienated from the other instructors on her team; this instructor “hasn’t 
found very much of that support at all ... and is finding it quite difficult... I 
know she feels quite out of place in any social event or meeting where the 
whole group is together.” 

As Simone explained: “If you play the game [ask for help], then they 
support you too.” However, there is evidence that she failed to follow this 
creed in her comment that she “[had] probably alienated the instructors from 
[her specialty area]; I really don’t have any support at all.” Because she had 
other school colleagues who were supportive, she did not find this situation 
devastating. 

Janine commented that 


as a faculty we sort of strive to develop more than just working relationship|[s] 
but, to a certain extent, personal [relationships] or friendships that ... don’t 
necessarily go beyond the work setting but at least at work people are friendly 
with each other. 


She added that “some people find their support from their students. I think 
that’s an important aspect as well, but I don’t think you can always rely 
solely on the students.” 

Martine described the small supportive groups that formed within the 
faculty as a “very informal thing.” There were people that she felt close to 
philosophically: “I do rely on them for support, but I don’t think it’s a formal 
thing.” When Cecile joined this faculty she said the “atmosphere was calm, 
relaxing and ... people were very supportive.” Odile agreed there was the 
“opportunity to build supportive relationships at all the social functions but 
she indicated there was a “fair amount of ‘cliquishness’ at these events.” 
Thérése reported that at meetings faculty needed supportive feedback about 
their teaching. 


Meetings as Cohesion Builders 


Just as supportive relationships were crucial for the nurse educators, meet- 
ings, the next category to be discussed, also acted as cohesion-building mech- 
anisms. “Correlation” meetings were the major mechanism for getting the 
school’s work done. Although these meetings were mainly used as mecha- 
nisms for sharing curriculum information, they also had other unwritten 
objectives. For all teams, correlation meetings were held twice a month and 
usually lasted between 90 minutes and two hours. The general format of 
these meetings consisted of instructors giving feedback on classes they had 
conducted the previous two weeks, highlighting areas well done and suggest- 
ing changes where appropriate. At this time, team members responded with 
comments, especially if they had attended these classes. It was common 
practice for faculty members to attend each other’s classes. Often instructors 
brought verbal feedback from student groups. When instructors distributed 
written evaluation forms, these were reported in summary form. 

Feedback sessions were followed by correlation. During this part of the 
meetings, the group of instructors who were presenting classes in the follow- 
ing two weeks would give a summary of their presentations. This summary 
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included instructions for other faculty members who may be teaching some 
of this content in small groups, during laboratory exercises, or on the hospital 
units. Handouts, considerable in number, were circulated. These summary 
presentations always stimulated comments and questions by team members. 
The investigator’s field notes indicated that the participation rate was high. 
For example, at one first-year meeting all but two instructors commented. At 
two third-year meetings, nine and seven instructors respectively made com- 
ments out of a total of nine team members. At two separate second-year 
meetings, all but two of the 13 team members participated in the discussion. 
When the investigator asked the five key participants whose duties called 
for attendance at correlation meetings to explain these events, they replied: 


Martine: On the surface it is informing instructors who don’t necessarily 
attend the classes themselves, what exactly is being taught ... to make sure 
that we are all touting the same party line. 


Thérése: When we discussed what was taught last week and how did the class 
go, usually there are general comments, what did the students have to say, do 
we need to make changes in it for next year—it is easier to do at the time 
because six months later ... it is long forgotten and so that kind of evaluation 
.. lS more meaningful and then many of the faculty members go to some 
classes ... so they will comment also. 


Janine: It’s done to keep everybody informed of what is being taught, dif- 
ferent strategies that are being used in presenting that material to the 
students ... what types of strategies have either worked well or are being 
planned to be used.... It was hoped that it would promote consistency among 
faculty. If you know what has been taught in class, then you can check to see 
how students are performing as compared to what was taught. 


Cecile: We do that because we need the feedback from the past lecture that we 
have given as to how to improve and what the students need a bit more of or 
we need some positive feedback too if we did a good job.... It allows the 
instructor, who doesn’t necessarily attend the lecture, to decide whether they 
should attend the lecture or prepare themselves in some way for the content 
so that they can work with the students. 


Simone: It was originally set up in order to get feedback on faculty to improve 
but I think now it’s just disintegrating to giving information, because the 
comments made, usually someone will say “the class went well. I need more or 
less time.” 


Three key informants concurred when the investigator suggested that 
correlation meetings served as integrating mechanisms, along with food and 
social events. The other three informants were not queried about this phe- 
nomenon. 

Do the above data suggest that in a women-dominated organization a 
distinct view of the world emerges in the workplace? How does this relate to 
the research literature? 


Conceptualizing the Female World 
Because women, for the most part, have been ignored in the research litera- 
ture, conceptualizing the female world is difficult. One term that attempts to 
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conceptualize the female world is “women’s sphere.” This term can be traced 
back at least as far as 1792 when Wollstonecroft (1891, p. vii) suggested this 
was a “separate” sphere that had about it a sense of “roundness,” an enclo- 
sure that was “cozy, protected, safe” (Bernard, 1981, pp. 80-81). 

The ideology of women’s sphere helped to support the belief that women 
were not intrinsically inferior to men but their sphere had to be kept in a 
subordinate position in the interests of the national welfare (Bernard, 1981, 
p. 91). The work carried on in women’s sphere also had a major contribution 
to make to society in the running of households and the raising of children. 
Women’s sphere was a place for nurturance against the outside world and 
was responsible for transmitting moral standards to society. It was believed 
that the home, along with moral and charitable activities, was “the natural, 
normal and only concern of women’s sphere” (Bernard, 1981, p. 91). 

In the 1830s, “nursing was seen as an innate and loving responsibility of 
women and mothers”(O’Brien, 1987, p. 12). Home was supposed to be a place 
of cheer and comfort with women as the major comforters: 


One of the most important functions of woman as comforter was her role as 
nurse.... The sickroom called for the exercise of higher qualities of patience, 
mercy and gentleness as well as for her housewifely arts. She could thus fulfil 
her dual feminine function—beauty and usefulness. (Welter, 1966, p. 163) 


Nursing, a women-dominated profession, clearly reflects this concep- 
tualization of women’s sphere. According to O’Brien (1987): 


Nursing was still very much a domestic art in the mid-nineteenth century. To 
practice it in the home was to fulfil the calling of “true womanhood.” To 
practice it for a wage kept one within the domestic sphere, a world where the 
only pertinent analogy was to the “chambermaids.” (p. 14) 


Around this time (1860s), the “new” profession of nursing was established in 
England by Florence Nightingale who insisted that women from the upper 
classes could make a respectable career out of nursing. In order to fit the 
Victorian era’s expectations of women, Nightingale “encouraged the myth 
that women’s [nurse’s] work outside the home was not considered ‘real work’ 
but merely an extension of home work” (Seymer, 1960 cited in Coburn, 1974, 
p. 156). This image of nursing was transmitted to North America (O’Brien, 
1987, p. 16). 

The ideology of women’s sphere was not easy to put into practice for 
women or nurses. Although this ideology was a “realm of the heart,” “women 
had to foster the ‘bonds of womanhood,’ the sisterhood, the support systems 
that were its essence and made it bearable in a world not always congenial or 
supportable to women” (Bernard, 1981, p. 91). Because women were prohib- 
ited from any significant involvement in the polity or the economy and were 
excluded from equal partnership with men, women survived by giving emo- 
tional sustenance to each other (Bernard, 1981, p. 91). 

Participation in women’s voluntary organizations was one way of pro- 
viding this sustenance. According to historians, 19th-century women in 
Canada and the United States had a strong propensity for cooperating with 
each other in voluntary organizations (Armeny, 1983, p. 14; Coburn, 1974, p. 
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133; Shaw, 1957, p. 37-75). “One reason for [the] proliferation of women’s 
groups was the sense of a special female identity, created, or at least inten- 
sified, by the gap between men’s and women’s experiences and responsibili- 
ties in an increasingly specialized urban and industrial economy” (Armeny, 
1983, p. 14). A common female heritage led to alliances between Canadian 
nurses and philanthropic women that flourished during the latter half of the 
19th century and the early part of the 20th century. One such alliance 
occurred in the National Council of Women of Canada (National Council), 
the most important Canadian national women’s organization, which was 
established in 1893 (Cook & Mitchinson, 1976). The Council “staunchy 
support[ed] the interests of Canadian nurses in war service, as in peacetime” 
(Shaw, 1957, p. 157). Nurses’ alliance with such women’s groups was based 
mainly on the precepts of “a sense of women’s solidarity in good works” 
(Armeny, 1983, p. 33). 

As indicated above, nursing developed in concert with the female world. 
The ideology of women’s sphere influenced the development of nursing. In 
order to try to bridge the gap between the “cult of true womenhood” and the 
work world of nursing, the fit between nursing and women’s special sphere 
was developed. Nursing was portrayed as merely an extension of women’s 
nurturing role at home along with the passive attributes associated with it. 


Characterizing the Female World 

A starting point for characterizing the female world is provided by polarities 
used by male social thinkers to explain the transition from the rural, prein- 
dustrial world to the urban, industrial world in the 19th and 20th centuries. 
These polarities were not based on gender differences and predated the 
female world conceptualization. Bernard (1981) is able to relate these polar- 
ities to both the female and male worlds. For example, in 1861, Maine 
(Bernard, 1981, p. 24) contrasted societies by distinguishing between those in 
which relationships were based on status and those in which relationships 
were based on contract. Relationships that are based on status are charac- 
terized by ascription and affectivity. Duties, obligations, and privileges are 
assigned to certain positions in society. Everybody’s needs are taken care of 
to the best extent possible. Relationships based on contract, however, involve 
duties, obligations, and privileges that are specified and agreed to by the 
involved parties (Bernard, 1981, p. 24). The former describes the female 
world, the latter the male one. 

Ténnies (1940, pp. 37-38) described two forms of human will as the basis 
of all social relationships. The first is “natural will” which is the basis for 
Gemeinschaft. Natural will spontaneously expresses the human’s inner 
wishes and demands and sees affiliation with others as an end in itself. 
Rational will, the second one and the foundation of Gesellschaft, denotes 
deliberation or calculation of both means and ends, viewing both these 
conditions as separate. Women’s relationships are generally based on natural 
will while men’s relationships are more closely associated with rational will. 

Boulding (1969) contrasted the economy with the “integry.” The economy 
is described as a world in which monetary exchange acts as an intermediary 
in human relationships, a description more akin to the male world. The 


362 


Realities of a Women-Dominated Organization 


integry, strongly associated with the female world, is defined as “that part of 
the social system which deals with concepts and relationships such as status, 
identity, community, legitimacy, loyalty, love, trust” (p. 4). Boulding defined 
the integry as a “distinctive segment of the social system with a dynamics of 
its own, and strong interactions with other elements of the social system” (p. 


4). 


Aspects of the Female World 


Gemeinschaft. Out of these polarities, Bernard (1981, p. 28) developed 
two primary characteristics of the female world. The first is the kinship- and 
locale-based or Gemeinschaft nature of the structure of the female world— 
“the realm of life and work in Gemeinschaft is particularly befitting to 
women: indeed it is even necessary for them” (Ténnies, 1940, p. 186). In 
1899, Veblen (1953, p. 21) discussed the gender-related aspects of society by 
indicating that women still maintained an “archaic view of things” that 
prevented them from having a direct relation to the industrial society of that 
time. Seventy-five years later, Scheppele’s research supported Veblen’s ob- 
servation by pointing out that the transition from Gemeinschaft to Gesell- 
schaft generally bypassed women (Scheppele, cited in Bernard, 1981, p. 28). 
In the movement from the rural to the urban areas, women lost the produc- 
tive duties they had performed on the farm. This was replaced by a division 
of labor where the wife became a housewife responsible for the house and 
children while the husband went out to the world of work. They did not 
establish relationships with groups of people who were unknown to each 
other. Instead they lived predominantly in Gemeinschaft, where women have 
always maintained a pivotal role in sustaining relationships among relatives 
and upholding family traditions that have been the mainstay of Gemein- 
schaft (Scheppele, cited in Bernard, 1981, p. 28). Bernard pointed out that 
the portrayal of the female world as Gemeinschaft rests on “evidence from a 
respectable research literature which documents ... that the world of women 
has indeed been a kin- and locale-based world—a world of Blut-und-Boden 
[blood and soil] that performs an integrating function (p. 28).” 

The roots of nursing derive from the Gemeinschaft orientation. Several 
authors have indicated that nurses’ roles have been strongly associated with 
the roles of mothers (Apple, 1988, p. 47; Delamonthe, 1988, p. 345; Fagin & 
Diers, 1984, p. 16; Fitzpatrick, 1977, p. 821; Glass & Brand, 1979, pp. 38-39; 
MacPhail, 1988, p. 48): 


During the nineteenth century, health institutions borrowed their model 
from the bourgeois family where the father dominated and performed the 
role of decision-maker (doctor, administrators), while the mother’s expected 
role consisted of “serving” activities, carrying out the wishes of the father 
(1.e., nurses) 


and patients were relegated to the role of children (Colliere, 1986, p. 103). 
During the 20th century, Ashley (1977) described the role of hospital nurses 
as one of caring for the “hospital family” (p. 17). “Like mothers in a house- 
hold, nurses were responsible for meeting the needs of all members of the 
hospital family—from patients to physicians” (p. 17). O’Brien (1987) main- 


363 


P. Valentine and G. McIntosh 


tained that historically nurses’ ability to “link home and hospital” (p. 16) was 
the main source of status and prestige of the profession. Coburn (1974) 
stated that “the ‘ideal lady’ transplanted from home to hospital was to show 
wifely obedience to the doctor [husband and] motherly self-devotion to the 
patient [child]” (p. 155) while Colliere (1986) contended that nurses’ “subor- 
dination to others’ thought and knowledge, and the duties toward everyone, 
were the appropriate rules to follow” (p. 102). 

Nurses have traditionally been the integrators in the hospital culture. 
Just as housewives usually occupy the pivotal position around which other 
family members revolve, nurses are the professionals who remain on the 
hospital units while other health care personnel move about freely. This 
results in other health care professionals ascribing to nurses the role of 
coordinators who are responsible for integrating the various components on 
health care. 

Love-and/or-duty ethos. The second characterizing aspect developed by 
Bernard to describe the female world is the love-and/or-duty ethos. The 
integry as described by Boulding (1969, p. 4)) is characterized by the love- 
and/or-duty ethos. Sorokin (1969, p. 21) in his studies of altruistic love in 
American society found the integry inhabited almost exclusively by women. 
Because women are expected to display altruism and compassion in their 
traditional roles as wives and mothers, choosing a helping profession such as 
nursing fits the stereotype of the appropriateness of nursing as women’s 
work (Lewin, 1977, p. 88). 

The role of mother is based on the love-and/or-duty ethos and plays an 
important part in shaping the female character (Bernard, 1981, p. 164). The 
mother is expected to nurture her child out of love-and/or-duty. Another 
characteristic of the love-and/or-duty ethos is selflessness. Samuels’ (1975) 
review of the literature on working-class women revealed that the predomi- 
nant values of these women were “duty, sacrifice, repression, denial of 
self-interest, self-expression and self-gratification” (p. 9), all characteristics 
of a “good” mother. 

“Nurturance, self-sacrifice and mothering” are characteristics commonly 
associated with nurses (MacPhail, 1988, p. 48). Nursing places great em- 
phasis on “nurturing and mothering” (Wilson, 1971, p. 217). “Womanhood 
was equated with motherhood ... and mother [was] equated with ‘nurse’” 
(Church & Poirier, 1986, p. 104). According to Roberts and Group (1973, p. 
318), nursing pioneers emphasized that hospital work required the “selfless 
devotion of mother” while Coburn (1974) stated that “nursing the ill ... [was] 
‘esteemed’ as [a] familial labour of love” (p. 155). Nursing was considered 
merely a supplement to the primary duties of housewife. Low wages and poor 
working conditions were accepted because nursing work was considered a 
public service or “labour of love” (Coburn, 1974, p. 155). “Nursing was to be, 
therefore, a woman’s duty not her job. Obligation and love, not the need of 
work, were to bind the nurse to her patient. Caring was to be an unpaid 
labour of love” (Reverby, 1987, p. 6). 

Although avant-garde women at the turn of the century recognized that 
the love-and/or-duty ethos was exacting a high price on women, attempting 
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to change it has proved to be a difficult task. That is, a large proportion of 
women conform, or at least try to conform, to the requirements of this ethos 
at whatever the cost. The love-and/or-duty ethos is nonegalitarian, demands 
a serving role, and expects those following it to be compliant and constantly 
serving others. “Stroking” best describes the behavior called for in the love- 
and/or-duty ethos. Stroking refers to “giving, supportive, compliant” (Ber- 
nard, 1981, p. 374) behavior such as “showing solidarity, raising the status of 
others, giving help, rewarding, agreeing, concurring, complying, understand- 
ing [and] passively accepting” (p. 502). Nursing epitomizes these “expres- 
sive” traits (Coburn, 1974, p. 155; Fagin & Diers, 1984, p. 16; Johnson & 
Martin, 1958; Roberts, 1983, p. 27; Wilson, 1971, pp. 218-219). 

Other aspects of the love-and/or-duty ethos relate to the manner in which 
women function in organizations. While men’s peer relationships, for the 
most part, have emphasized competition and decision making based on a 
hierarchical structure (Kalisch & Kalisch, 1982, p. 23), women’s relation- 
ships have largely been based on the love-and/or-duty ethos. Women in public 
service administration, according to Neuse’s research (1978, p. 440), were 
less committed to formal hierarchy but instead were more willing to sub- 
merge their personal power in hierarchical ranking. Women “were more apt 
to develop lateral relationships and cross-relationships,” than were men 
according to Ortiz and Marshall (1988, p. 183). Fairholm and Fairholm’s 
research findings (1984) concluded that female school administrators tended 
to favor tactics such as “coalition building, cooptation, and the use of their 
personalities” (p. 72) to get the work done. The data on nurse educators 
supported the Fairholms’ findings. Food and social events were used to 
facilitate relationships among the nurse educators, to “bring faculty closer 
together” to foster common action. 

Charters and Jovick (1981), Fishel and Pottker (1975), and Meskin 
(1974) found that female principals stressed cooperation in interpersonal 
relationships that resulted in a nonstructured, participatory style of decision 
making that encouraged inclusiveness rather than exclusiveness (Kanter, 
1977). Women’s consistent desire for consensus in decision making was 
supported by Dumas (1985, p. 98), a nurse. 

According to Shakeshaft (1987), several studies have found that “women 
[were] perceived as being more democratic and participatory than [were] 
men” (p. 187). For example, Hemphill, Griffiths, and Frederiksen (1962, p. 
333) found that female principals involved more teachers at all levels as well 
as outsiders in their in-basket problems while Hines and Grobman (1956, p. 
100), in their self reports of principals, found female principals to be more 
democratic than male principals. One conclusion of Charters and Jovick’s 
(1981) research was that “more participatory decision making appeared in 
female-managed schools” (p. 322). The data on the nurse educators indicated 
that in all three teaching teams, there was a high rate of participation by all 
team members and decisions were usually made by consensus. This is not to 
suggest that all women are more participatory in this approach to manage- 
ment than all men. 
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The administrative research literature does not identify a need for sup- 
port or the need for at least one supportive relationship at work as the 
findings from the study of nurse educators indicated. However, Gilligan’s 
(1982) and Miller’s (1976) research suggested that women develop “in a 
context of attachment and affiliation with others” (p. 83). Therefore, suppor- 
tive relationships among women may be an important part of the work 
environment. 

Bernard’s (1981) and other authors’ depictions of the female world sug- 
gest that any attempt to characterize this world simply means that “the 
norms—legal, conventional, moral, whatever—prescribe, permit or at least 
tolerate this behaviour” in females (p. 30). It does not mean that all female 
members conform to every item described above because, to date, there are 
insufficient data from which to develop a comprehensive and unified portrait 
of the female world (p. 30). “Neither does it imply that all the women in it are 
angels or saints or loving, serving goddesses, or embodiments of the ‘true 
womanhood’ celebrated in women’s sphere in the nineteenth century” (p. 
504). 

All the characteristics described above led Bernard (1981) to conclude 
that the female world is “an integrating system with as much conceptual 
validity as say the economy or the polity” (p. 29). Although this integrating 
system, like other aspects of the female (nursing) world, has not been careful- 
ly researched, it is apparent from the above description that it is recognized 
as a part, perhaps a somewhat invisible part, of society. 


Summary, Conclusions, and Implications 

In summary, the nursing world has evolved in tandem with the female world 
exhibiting Gemeinschaft and the love-and/or-duty ethos. The four major 
categories generated from the data exemplified the Gemeinschaft nature of 
the school and the presence of the love-and/or-duty ethos. Nurses have 
traditionally given sustenance to others—patients, physicians, and family 
members. The preceding data suggest that for the nurse educators, the 
categories of food and social events as integrators, support as essential, and 
meetings as cohesion builders exemplify the ways these women attempted to 
nourish each other. Food and social events served mainly as facilitators for 
interaction. Social events, usually accompanied by homemade food, were 
used to celebrate births, birthdays, and marriages, to mark the beginning and 
end of the school term, and to express appreciation for assistance given to 
students. 

Support was a word liberally used by all instructors and was readily 
acknowledged as a necessary ingredient in the school. Being supportive was 
considered an important aspect of course coordinators’ roles. The six key 
participants agreed they needed at least one supportive colleague at work. 
These relationships were more than just working relationships; they strove to 
be friends. The nurse educators valued support and tried, for the most part, 
to be supportive of each other. Meetings as cohesion builders reflected the 
emphasis on facilitating the implementation of curriculum content. They 
also were opportunities for “co-relating.” Meetings were used to dispense 
information, as a method for sharing teaching content and strategies, per- 
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sonal anecdotes, or food, and as a mechanism for supporting team members. 
One objective of team meetings was to meld the teams into smoothly func- 
tioning units. 

In conclusion, the female nurse educators exhibited orientations unique to 
the organizational and administrative research literature. The school had a 
distinctive culture that contained features that could be attributed to both 
the nursing and the female worlds. Examples are the strong need for integra- 
tion as expressed in constructs such as food and social events, the need for 
meetings, and the need for support. The case study method provided a rich 
data source on the world of nurse educators that is exemplified in the 
participant narratives. 

There are several outcomes or recommendations that have implications 
for organizational and administrative researchers and administrators. Orga- 
nizational and administrative researchers need to think differently about 
organizations. Because administrative theory currently reflects a male-orien- 
ted world, there is a need for the incorporation of a perspective consistent 
with the female world so that the theory is truly representative of both 
women and men. 

There needs to be further refinement of the female-world concept and 
identification of other aspects of this world. This might be accomplished 
through more intensive studies of women-dominated organizations, for ex- 
ample, girls’ schools; women’s colleges; hospital, college, and university nurs- 
ing programs; organizations in which the composition results in a clash of 
male and female cultures such as elementary schools; and organizations 
where males dominate such as universities, engineering faculties in par- 
ticular. 

Female culture should be carefully considered by administrators, especial- 
ly administrators in women-dominated organizations such as nursing and 
elementary education. For example, administrators need to recognize the 
dilemmas women face when they have sick family members and develop 
mechanisms to help them cope with this situation. The emphasis on develop- 
ing camaraderie among faculty, facilitating effective relationships between 
faculty and students, and coping with stress through mechanisms such as 
food and social events need to be given special attention by administrators. 
Because this research indicated that it was crucial to have at least one 
supportive relationship at work, mechanisms need to be built into organiza- 
tions to promote supportive relationships among women. Administrators 
generally need to recognize the importance accorded relationships by women 
and develop methods for facilitating relationships in organizations. They also 
need to recognize that supportive relationships that fail to recognize differen- 
ces among women and competition may not necessarily be the best strategy 
for dealing with the realities of the work world. 

Perhaps a result of the better integration of the literature on female 
experiences in organizations, there may be less inclination to give female 
administrators courses and books on how to be more like male ad- 
ministrators but instead an increased emphasis on changing organizations to 
better fit the aspirations and values of the female world. In the longer term, 
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the female perspective may be promoted by administrators as a way to 
improve the “health” of organizations. 
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PERSPECTIVES 


“Who is Sylvia?” Reflections on Personality, 
Pedagogy, and Populism: A Review Essay 


PATRICIA T. ROOKE 


University of Alberta 


Sylvia Ashton-Warner (1908-1984), a New Zealand teacher, created a method of 
teaching Maori children through her “Key Vocabulary” concept. Such a method 
was translated to minorities in Australia, Canada, and the United States. A 
charismatic figure who was adulated as the author of Spinster (1955) and Teach- 
er (1963), hers is a life that illustrates the dynamics between pedagogy and 
personality. 


Criticizing Sylvia Ashton-Warner in the 1970s in North America was as 
unthinkable as criticizing Mother Theresa of Calcutta. (Hood, 1988, p. 214) 


Artists don’t get formed, they’re born. (Sylvia Ashton-Warner) 


In light of Sylvia Ashton-Warner’s dismissal of those psychological, familial, 
and cultural factors usually attributed to the formation of personality, the 
development of character, and the shaping of life course and career, the 
biography of New Zealand’s most celebrated pedagogue makes for interest- 
ing reading. Indeed Hood’s (1988) Sylvia? The Biography of Sylvia Ashton- 
Warner is a fascinating study of a remarkable woman. It is not, however, 
altogether a sympathetic one. 

For those who did not live through the excitements of the ’60s and early 
’70s, Ashton-Warner’s achievements already have faded into obscurity. Alas, 
this is one of the perils of populism. For others, myself included, her pedago- 
gy—as absurdly replicated as it was on occasion—was a clarion call to 
subversion in my Teachers’ College in Australia. Like their inventor, we too 
rather fancifully resisted becoming just another “dreary teacher” in a 
“bloody profesh” (p. 55) that, in all our youthful brashness, seemed to ex- 
emplify the most pedantic and unimaginative aspects of an occupation that 
promised women little advancement. 

Sylvia was born in 1908 in New Zealand, and after a childhood as a 
student in her mother’s classrooms and an apprentice teacher she entered 
Auckland Teachers’ College in 1928. Most of her career was spent teaching 
Maori elementary children in small and remote schools under her husband, 
Keith Henderson, the headmaster. Henderson fully supported her efforts in 
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developing a new reading method published in five installments in National 
Education (December 1955 to May 1956). “The Maori Infant Room—Or- 
ganic Reading and the Key Vocabulary” was subsequently recommended in 
New Zealand Teachers’ Colleges and advocated for use elsewhere. Her teach- 
ing style was so dramatic and her pedagogy so radical that they inspired 
imitation and adoption in New Zealand, Australia, Canada, and the United 
States, particularly by those involved in the teaching of ethnic and racial 
minorities. Largely unrecognized in her native country, it was not until the 
1960s that she was acclaimed as an inspirational and radical classroom 
practitioner (as distinct from theoretician). A pedagogy rather than an edu- 
cational philosophy, her work was based on a belief that emotions, spon- 
taneity, and self-expression were essential to create a moral and aesthetic 
learning ecology. 

She became a celebrated novelist when Spinster (Ashton-Warner, 1955) 
was turned into a MGM movie starring Shirley Maclaine, Jack Hawkins, and 
Lawrence Harvey. A second book Teacher (Ashton-Warner, 1963) which 
expounded her pedagogical views in relation to her classroom experiences 
and conflicts with officialdom was published by Simon and Schuster. Both 
books were seen by single women as a vindication of their status in a 
profession that did little to raise their self-esteem and more generally were 
popular because of their provocative descriptions of an ordinary embattled 
classroom teacher contending with the insensitivities and indifference of 
administrative bureaucrats. The oppression of “the establishment” and “the 
system,” buzzwords several decades ago, was symbolized in oppression by 
“the department” (of Education) in Ashton-Warner’s professional life. Ash- 
ton-Warner in 1970 was invited to assist in the founding of an alternative 
school in Aspen, Colorado, which experiment by the 1980s had proven to be 
among the most successful of its kind in the country. In 1971-1972, now a 
widow whose children were grown, she was invited to take up a visiting 
fellowship at Simon Fraser University where she conducted workshops and 
courses for elementary teachers-in-training. She died in New Zealand in 
1978. 

These, then, are the facts but as historians and biographers know facts do 
not speak for themselves. Most biographers come to their subjects with 
preconceptions about a historical actor who has left his or her mark on 
society. Biography, therefore, is a journey into the past written from the 
vantage points of present insights, and sometimes disadvantaged by these 
preconceptions. This biography is no exception except in one striking par- 
ticular. 

It appears that Hood’s (1988) exploration is as much a vindication of a 
nation as it is a spotlight on the perils of populism in the life of an extraordi- 
nary woman. Throughout the narrative the evidence is carefully compiled to 
construct an argument that dissects her subject’s ambivalent relationship 
with New Zealand which Ashton-Warner claimed continued to misunder- 
stand and persecute her even as North Americans were recognizing and 
acclaiming her. One suspects that the motive behind this approach was to 
answer Ashton-Warner’s (1979) I Passed This Way, an autobiographical 
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apologia and vilification of New Zealand. The biographer, it appears, feels 
personally offended by Sylvia’s repudiation of that tiny nation-state—“these 
little islands in the South Pacific” (p. 64). She is anxious that readers not 
believe Sylvia’s criticisms of her own culture. “I’ve hated New Zealand so 
much,” she confessed, “I still hate it and I suppose I always will” (p. 158). 
These bitter words were uttered when her country acclaimed her book 
Spinster. For Sylvia the acclaim had come too late. 

In the final analysis Hood has argued her case so compellingly and 
scrupulously that one is left with the decided impression that Sylvia Ashton- 
Warner was less misunderstood and unappreciated than she was an in- 
dividual with a personality that constituted an unrepentant paranoia. 
Indeed, it is argued that rather than any objective reality the psychological 
status of Sylvia’s personality itself induced a chain of events that contributed 
to repression of her pedagogical ideas by the New Zealand Education Depart- 
ment. What she perceived as sustained opposition to her ideas she internal- 
ized into feelings of “persecution, rejection, and spiritual murder” (p. 246) 
and transferred these feelings into a persona that was imperious, intolerant, 
capricious, manipulative, controlling, and altogether unlovely. This is no 
hagiography. 

The fundamental proposition to her reading method was potent and 
direct! The key vocabulary method was best taught and better learned by 
using children’s vocabulary and by having them create their own primers and 
lessons out of their experiences. “Pleasant words won’t do,” she claimed, 
“respectable words won’t do. They must be words organically tied up, organi- 
cally born from the dynamic life itself. They must be words that are already 
part of the children’s beings” (p. 136). Her method was simple: a daily period 
of free play (music, dance, drama, and art as the media) followed by reading 
activities which revolved around homemade books and wall newspapers 
dictated by the children and printed by the teacher. In short, it was an 
individualized reading program which in many ways did not break new 
ground, whereas others (e.g., the “key” vocabulary) did. Her formula too was 
basic: “Release the native imagery of a child and use it for working material” 
(p. 103) and in relation to the creative subjects she followed this with “Touch 
the true voice of feeling and it will create its own power, its own pace, its own 
style, and its own vocabulary” (p. 221). Curiously, neither precept is par- 
ticularly startling now and neither was it then. 

Child-centered pedagogy had already made inroads under the New 
Zealand Education Department’s Dr. C.E. Beeby who promoted the experi- 
ments known as Britain’s “New Education” (Selleck, 1968) and America’s 
“Progressive Education” (pp. 72-73). He had introduced these ideas through 
radical administrative and curricular changes and the wide circulation of 
professional publications. Hood (1988) is correct when she notes that Ashton- 
Warner’s “insights came to the public at a time when the life spirit of the age 
was saying the same thing in a different way” but fails to develop the reasons 
for her subject’s peculiar contribution. This lack of broad educational per- 
spective weakens the biography although the omission is not really unex- 
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pected in that biography is the history of an individual’s past and only 
sometimes reflects collective experience. 

When Sylvia was formulating her views on the creation of a learning 
environment in the classroom as distinct from a teaching method (work that 
had begun in the 1930s only to be affirmed and put into practice much later), 
New Zealand elementary education no less than elsewhere was in the process 
of being transformed. Philosophers such as Dewey whose conceptualizations 
of “organic education” and the experiments at the Chicago Laboratory 
School were central to the general debate about education for democracies 
that resounded in the first decades of the 20th century. Cremin (1961) in The 
Transformation of the School comments that organic education’s raison 
d’étre was “to preserve the sincerity and unselfconsciousness of emotional 
life” (p. 149). The last decades of the 19th century had set the stage for such 
experimentation. Oppenheim had founded The Organic School of Education 
and had written The Development of the Child as early as 1898 whereas 
Dewey’s work on the “whole child,” Experience and Education (1938), was 
merely a restatement of earlier views found in School and Society (Dewey, 
1900) and The Child and the Curriculum (Dewey, 1902). Moreover, Hender- 
son experimented with the new ideas and methods at New York’s Pratt 
Institute in the first part of the 20th century with Johnson promoting these 
principles in her experimental classrooms while writing about the project in 
Youth in a World of Men (Johnson, 1929) and later, Thirty Years with an 
Idea (Johnson, 1939). While there is little indication that Sylvia—being 
contemptuous of anything American and not being in the least theoretically 
or intellectually curious—had examined any of these works, she did, however, 
acknowledge the influence of the British don Sir Herbert Read whose well 
known ideas culminated in Education Through Art (Read, 1943). Sir Herbert 
met her in 1963 and wrote the introduction to Spinster. His praise gave 
considerable credibility to her work in a country that greatly admired English 
academics. Nonetheless, while Sylvia may have disclaimed intimate know- 
ledge with many of the ideas of progressive education, she could hardly have 
failed to absorb them, as they were becoming increasingly integrated into the 
elementary schools of her country. Moreover she was surely influenced by 
those 19th-century educational ideas which were part of all Teachers’ College 
curricula in the 1920s. Such educational theorists as Pestalozzi (1894) and 
Froebel (1895) organized learning activities around ideas of the “organic 
nature” of learning and centered their pedagogies on the creative arts, object 
lessons, and sense data. Sylvia herself dallied with Rousseau’s (1979) ideas 
on “natural education” when she was in Auckland. A brief purview of these 
earlier educators illustrates how these ideas were already part of the kinder- 
garten and infant school movements in Britain and Europe when Sylvia 
Ashton-Warner was developing her own version. It was, however, her in- 
dividualistic style of teaching reading to non-English speaking minority 
children that revitalized such ideas. 

If organic education was not a new idea and given that Sylvia was not a 
particularly successful teacher, we can better understand the Education 
Department’s less than wholehearted response to her work. Arbitrary, given 
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to volatile altercations, contemptuous of authority figures yet alarmingly 
authoritarian herself, she was absent from the classroom for long periods of 
time when subordinates took over the routine and core subjects while she 
controlled the creative subjects. More surprisingly the children themselves 
were the ones who remained unimpressed by her schematic imitations of 
their artwork for their readers and newspapers. Apparently her motto “get 
down to their level” failed to inspire their enthusiasm (pp. 138-139). While 
Sylvia claimed that—and there remains further room for speculation with 
regard to the issue—the Department deliberately ignored and destroyed her 
reading materials, one might sympathize with the opinions of the bureau- 
crats in Auckland that to publish official readers from her organic materials 
with her dramatic illustrations would defeat the whole purpose of the organic 
method and the key vocabulary. Curiously, Sylvia failed to see the logic of this 
view. On the other hand it was a loss to the teaching profession that such 
materials were not published. Whatever the truth of the matter the Educa- 
tion Department was careless in the extreme with materials Sylvia had 
worked on so lovingly for years and the exchanges between them only served 
to exacerbate her sense of persecution. 

It is scarcely surprising that it was during the 1960s and early 1970s when 
North Americans, on hearing about her work, visited her school and finally 
claimed her as a visionary. This was a time of intellectual ferment, bold 
educational innovation, curricular expansion, and astonishing growth in 
“alternate” forms of schooling which rejected common wisdom about sequen- 
tially graded learning processes, and when liberal and radical critics alike 
attacked mass schooling. The pool of public monies seemed bottomless; 
expansion facilities could not balance teacher supply and demand, and the 
counter culture was at its most militant. The reexamination of orthodox 
forms of schooling was fuelled by prosperity, protest, and optimism. “De- 
schoolers” and “alternate schoolers” flourished: Bereiter at OISE (Bereiter, 
1973); Holt (1964); Illich (1971); Reimer (1971); and Postman (Postman & 
Weingartner, 1969) to name only a few. Inquiries into education were 
producing documents such as Ontario’s Hall-Dennis Report (Ontario, 1968) 
which shaped much of the transformation of the schools in Canada and 
Alberta (Alberta, 1972) as did the Plowden Report (Great Britain, 1967) in 
Great Britain. The Sputnik factor contributed to a ruthless critique of school- 
ing in a democratic society in the United States. 

Not since the debates surrounding pre-World War I progressive education 
and education for social reconstruction in the 1930s was there so much 
controversy about the nature, purpose, and direction of schooling in North 
America. Sylvia Ashton-Warner symbolized this growing discontent and 
seemed to articulate the projections and frustrations of many classroom 
teachers in her idiosyncratic manner and dramatic teaching style. She was 
recognized during a period when teachers were reasserting a professionalism 
they felt had been eroded and at a time when economic prosperity allowed 
them to challenge the hierarchical structures which, they felt, repressed their 
autonomy and independence. The relevance of principals, inspectors, and 
traditional pedagogies were in doubt. Therefore, the most ardent of her 
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devotees were teachers who felt misunderstood by “the system”; those who 
felt that their creativity was constrained by existing structures; those who 
taught racial minorities and saw her method as a “quick fix”; and those who 
opposed basal readers and core curricula on principle in their enthusiasm for 
spontaneity and liberated emotions. These decades were throbbing with such 
malcontents whose protestations were directed at centralized and bureau- 
cratized school systems and whose faith in education as a major social change 
agent was unbounded. 

In fact many of Ashton-Warner’s admirers quite misunderstood her, al- 
though ultimately it was in North America that the dissonance between the 
views she espoused about liberated pedagogies and her own classroom style 
became apparent. Liberated they may have been but only if orchestrated by 
the artist-teacher in conformity to her will. Hood carefully compiles evidence 
that throughout Ashton-Warner’s career there had been an overwhelming 
craving for adulation and a tendency to exhibitionism. In the classroom she 
always had to be the star and this had been the inspiration behind her work. 
While her often ferocious psyche was transmuted in the creative arts it was 
“Sylvia’s own creativity rather than the children’s that reigned supreme” 
(Hood,, 1988, pp. 1384-135). The children expressed her vision rather than 
their own. 

I would argue, therefore, that the transferability of her ideas remained 
problematic—something the Education Department recognized when she 
initially approached it and when its inspectors first discovered her. A method 
might be translated to other classrooms—although as anyone who works 
with curriculum understands even this cannot be guaranteed. But to other 
regions, countries, and cultures? That is quite another matter. But more to 
the point Sylvia’s work was not really a method at all; rather it was a 
teaching style. Without Sylvia it foundered. It was Sylvia who provided the 
charisma and Sylvia alone. Buber (Cohen, 1983), a philosopher and teacher, 
is a not dissimilar case as is the case of Neill at Summerhill (Neill, 1960). The 
disciples never equalled their leaders. 

That Sylvia’s admirers at Aspen community school in Colorado became 
her detractors was not unexpected. They had invited her to administer the 
curriculum, demonstrate her methods, and offer expertise and guidance to 
their utopian educational vision. But in all respects she was a disaster. In this 
instance there was no bureaucratic obstructionism. A little homework on the 
part of those educators who invited her to Aspen might have avoided consid- 
erable disillusionment and no little expense. For example, they might have 
found out that while developing the key vocabulary for Maori youngsters hers 
was a strictly “pakeha” (European) emphasis even as she deplored the 
“Department’s” ethnocentric reading materials. Neither did she see the 
Maori children’s stories as supplementing departmental texts but as leading 
the way to their integration into the dominant culture (Hood, 1988, p. 137). 

It seems too that her interpretation of what constituted experience was 
always at the emotional and personal rather than cultural level and in this 
respect she departed from Dewey and many other progressive educators. 
While descriptions of beatings, drunkenness, hatreds, all colorfully peppered 
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with obscenities were “experiential,” it was not quite to the taste of tradition- 
al teachers. While admitting that organic learning was therapeutic they 
rather doubted that it was at the same time educational. However, her Aspen 
colleagues were less interested in this philosophical problem than they were 
in another matter. Sylvia needed unquestioning conformity from the child- 
ren even as her rhetoric spoke of nonconformity; and more, she needed 
“obedience and attentiveness for her method to succeed, and she had, in the 
past, used the strap, the stick, the ruler, or her bare hand to ensure full 
co-operation” (Hood, 1988, p. 204). In American schools in the 1970s this was 
barely acceptable and at Aspen it was not acceptable at all. As for school 
meetings which debated philosophies and teaching methods, for Sylvia “these 
were a waste of time, and they confirmed her longheld conviction that 
Americans were undisciplined, garrulous, and arrogant” (p. 203). Yet the 
school meeting was the focal point around which Aspen revolved. The fact 
that she brought her prejudices about American culture with her did not 
escape them although they could not have anticipated her vitriolic and 
blistering contempt when she wrote her exposé of “the American character” 
in Spearpoint (Ashton-Warner, 1972), an unrestrained and inaccurate ac- 
count of the Aspen episode. 

I recall the tenor of some of the reviews I read of Spearpoint; how 
conservative reviewers delighted in the exaggerations and misinformation 
contained between its pages. Here, after all, was a celebrated radical educa- 
tor whose work with underprivileged children of a racial minority from 
another country had been eminently successful but who was unable to work 
with middle-class American children. The failure could not be hers but a 
consequence of the degeneracy of American society, the permissiveness of its 
childrearing practices, the disintegrations of family life, the shoddiness of 
progressive education, the disorder of the “new left,” and the excesses of 
individualism. At one and the same time it was a curious situation. Sylvia 
was being hailed as an advocate of radical and innovative education. She was, 
ironically, mistaken as being “on the left” by the noisy critics of capitalism 
when in fact she despised the “masses” as “monstrosities of drooling in- 
sanity” (Hood, 1988, p. 131). Sylvia was both artiste and elitist and praised 
by educational and political conservatives who agreed with her judgments 
about the American “character.” Spearpoint cannot be seen as an apologia 
for her own failures. Sylvia did not admit failure. Rather it was a frontal 
attack to keep critics at bay. However, this did not daunt her from submitting 
her personal, literary, and education papers to Boston University so as to 
publicly reject the National Library of New Zealand’s appeals to house them 
(p. 185). 

Having tolerated her incompetence, alcoholism, egoism, and contempt, 
the Aspen community felt itself well rid of her when she rejected an invita- 
tion to develop peace textbooks at the University of Jerusalem Peace School 
in favor of an invitation to take up a visiting fellowship at Simon Fraser. A 
full professorship at Simon Fraser seemed an unlikely honor for she had few 
academic abilities and little interest really in teaching others her methods. 
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Ashton-Warner’s time in Canada, while not untroubled, was comparative- 
ly productive. For two years she conducted intimate off-campus workshops 
and while she was thought to be eccentric and arbitrary her six-week courses 
called “Mornings with Young Children” were conducted two mornings a 
week. They simulated a classroom with students required to regress to a 
five-year-old level. This was something she had been able to do all her life 
(Hood suggests she was psychologically “fixated”) and was a crucial means 
by which she touched the psyches of children so immediately. A two-year 
experiment in organic education, the Vancouver Project, which grew out of 
the workshops was taken into elementary classrooms by nine primary teach- 
ers. While not judged as memorable, by its conclusion Professor Ashton- 
Warner had returned to her native country. Though not remarked on by the 
biographer, Ashton-Warner’s presence at Simon Fraser was less improbable 
than first impressions suggest. This new Canadian university was the em- 
bodiment of an “organic” vision of education in its relationship between 
informal and formal modes of learning and the connections between univer- 
sity and community. This was evident in its choice of instructors who not 
only emphasized interdisciplinary programs but also combined the practical 
with the theoretical. Even its choice of architectural design combined the 
aesthetic vision with a bold functionality of open interior spatial arrange- 
ments. It was a campus where the student body was militant and the 
presence of a “radical” on the Education Faculty satisfied their demands for 
instructors who did not represent the traditional academic norms. Her 
presence, therefore, was consistent with the institution’s avant-garde philos- 
ophy. 

That some discomfort arises from reading Hood’s (1988) portrait of Syl- 
via Ashton-Warner has already been alluded to. Her portrait, so incisive on 
many counts, nevertheless fails to place Sylvia in a broader sociohistorical 
context in its use of bold interpretive strokes which psychologizes its subject. 
This resort to the psychological explanation is disappointing in light of 
feminist scholarship which points to the systemic discrimination of female 
pathbreakers in all the professions including the “feminized” ones such as 
teaching where men manage and women teach. To avoid the seductions of 
what might be summarized as “the revelations of a personality disorder” we 
must not forget either the history of female marginalization or the distor- 
tions of female personality formation wherever there is endemic sexism and 
whenever women are compelled to respond and react to cultural biases that 
Stereotype female behavior. Sylvia Ashton-Warner was hardly the first 
woman, nor will she be the last, to manipulate this climate, resort to deviant 
strategies, and even depend on alcohol in what amounted to gender-specific 
responses to sustain a fragile personality and her “art” at the same time. 

While it is true that Sylvia’s personality and temperament were neither 
easily contained within nor constrained by the parameters of a male-defined 
educational system, her life and the biographical genre itself are trivialized 
by Hood’s attempt to minimize the significance of Sylvia’s own description of 
her social order—“THE PERMANENT SOLID BLOCK OF MALE EDUCA- 
TIONAL HOSTILITY (p. 158)”—especially given Hood’s astute observation 
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about Sylvia’s obsessive demands for male adulation because it was men 
“who controlled the status heap” (p. 94). None of this can be fully ap- 
prehended without placing Sylvia in a patriarchal society whose power 
defined her as a woman, bestowed credibility on her reputation and work, 
and chose whether or not to concede the recognition she so desperately 
craved; and a society where married women were second-class citizens (p. 39) 
and spinsters, though independent, were stigmatized. Sylvia’s dilemma is 
common to her gender and known by all its members. “No matter which way 
I wriggled and dodged, I couldn’t get clear of teaching. All of which narrowed 
the crises down to a clear-cut choice: love or freedom,” she stated (p. 64). 

To be fair to Hood (1988) she does not set out to write a feminist biog- 
raphy or, for that matter, a biography about a feminist (Rooke & Schnell, 
1989). Sylvia may have been a strong-minded, individualistic woman, but she 
did not claim to be feminist at either a conscious or ideological level. This 
does not mean, however, that she could not have been subject of a feminist 
biography as distinct from a biography about a singular female. To my mind 
at least, not placing her into the broader framework of feminist theory or 
using gender as an analytical tool leaves Sylvia standing alone—a sport of 
history—whereas had she been so placed in a gender context she could have 
offered insights with regard to women of her generation and society in a 
more general sense. Feminist discourse opens new insights and interpreta- 
tions, whereas psychologizing the historical actor at times reduces her to a 
pathology. It mechanistically uses a popular shorthand that bypasses the 
complexities of societal factors so that human personality is seen solely in 
terms of familial influences and emotional elements. 

This reliance on the psychological paradigm exemplifies 20th-century 
biography. Nineteenth-century biographies concentrated on the view of his- 
tory as that of great men, whereas under the Freudian shadow many histori- 
cal worthies have been “put on the couch” so to speak. This biographical 
approach has not diminished since Strachey’s (1918) pioneering efforts when 
he analyzed historical figures such as Florence Nightingale and Cardinal 
Manning in his popular Eminent Victorians. However, it is in Hood’s psycho- 
logical explanations that the biography is tenuous on occasion. Although the 
dominant teacher-mother (whose abusive teaching style Sylvia internalized) 
and the much adored but ineffectual father provide an archetypal backdrop 
in which the central tensions and paradoxes of her subsequent life are acted 
out compulsively and in the long run, creatively, this psychodrama is not 
altogether convincing. For example, much is made of her choice of teaching 
as a profession and is seen as an “identification with the aggressor” (her 
mother). Fearing her mother’s power, it is suggested, Sylvia assumes the 
teacher persona to control that power by taking it on herself. 

Lortie’s Schoolteacher (1975) offers less convoluted possibilities for the 
predominance of females in the teaching profession and why they continue to 
choose it. Moreover, these operated more forcefully earlier in the century 
than they do now when career options have increased for women. These 
include familiarity with the classroom and the security represented by this 
ongoing familiarity; the psychological reinforcements of the school environ- 
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ment for bright students whose identity was dependent on academic achieve- 
ment; the influence of one or more strong-willed female role models; the 
comfortableness of relating to children which to some is easier than to adults 
and equals; the need to be in control and exercise unquestioned authority; 
and finally, perhaps even most importantly, the fact that teaching and family 
life (i.e., motherhood) are compatible in terms of time, holidays, and emotion- 
al involvement (especially in relation to young children). All of these pertain 
to Sylvia but particularly the last given that she could not imagine life 
without marriage and motherhood. She had seen her own mother under 
great stress continue to teach. Instead she found a partner who would take 
over the bulk of domestic and parenting duties. But do we need to go any 
further than to comprehend how few career choices there were for women in 
New Zealand in the 1920s? At this time career choices were both gender-re- 
stricted and gender-specific—nursing, teaching, and secretarial work being 
the common alternatives. One cannot suppose that either the constraints of 
gender or of social class did not operate any less in Sylvia’s case. 

However, even the traumas of childhood and the relationships between 
the girl-child and her parents might have been illumined by feminist analyses 
of object-relations and gender formation in families, particularly in light of 
the mother-daughter dynamics expounded by Chodorow in The Reproduc- 
tion of Mothering (1978). This would have been more useful than the trans- 
ference of the Electra complex into classical psychological metaphors and 
primordial reenactments with Mother and the Almighty (Hood, 1988, p. 
124). 

It is when one considers feminist approaches to biographical materials 
that the parallels in apparently different experiences can be illumined and 
contrasted. For instance, the similarities between Hood’s subject and my own 
biographical subject are more striking than otherwise despite the one being a 
New Zealander and the other a Canadian. Of course, both were the products 
of Anglophile nations with strong colonial connections and elitist sen- 
sibilities. No Bleeding Heart: Charlotte Whitton a Feminist on the Right 
(Rooke, 1987) argues its case around two ideas which are pertinent to Sylvia: 
the concept of “the woman-identified-woman” as distinct from “the man- 
identified woman”; and the centrality of marital status in the lives of earlier 
generations of professional women. Both were gendered artifacts of a 
homosexist culture that divided and conquered women by the coercive psy- 
chology of “real status” whose symbol was summed up in the question, “Has 
She got a boy?” (p. 50). 

Ashton-Warner and Whitton retained an intense emotional identification 
with their own sex.” If would appear that Whitton by living in close com- 
panionship for 30 years with a woman and associating in a feminized occupa- 
tion (social welfare) with women who were both colleagues and friends would 
represent more obviously the “woman-identified-woman” than the married 
Sylvia. Yet this was not unequivocally the case. Like Whitton, Sylvia before 
marriage was influenced by strong celibate women who epitomized the Vic- 
torian and Edwardian ideal of “chastity as power” in a time when marriage 
and career were rarely possible (Rooke & Schnell, 1985). In those instances 
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where they were possible, career advancement for women was impeded by 
marriage. Because of the tensions created by the compromise involved in 
marriage Sylvia was unable to diffuse her fascination with “spinsters or her 
erotically intense but genetically forbidden relationships with her own sex.” 
These attachments were necessary because they provided her with emotional 
and professional sustenance which Keith Henderson for all his supportive- 
ness could not supply enough. 

Both women exhibited that “romantic” sensibility of same-sex love which 
had inspired and supported women before Freudian psychosexual theories 
eroded the former support systems and human relations were redefined 
under an aggressive heterosocial and heterosexual hegemony. Faderman, 
Surpassing the Love of Men (1981), is just one historian who has recorded 
this. The new climate, which impeded the progress of women in the public 
sphere while accommodating to changing economic realities and voting pat- 
terns, increased the male-identification of wives where all emotional and 
romantic needs were to be met in the ideal of “the couple.” This vociferous 
psychologizing and “normalizing” of human relations effectively neutralized 
women whether they remained single by accident or by choice or whether 
they married and were expected to forsake career altogether or subordinate 
it to the male “breadwinner” role. The differences between the accommoda- 
tions or the resistance are less important to women’s collective past experi- 
ence than the male-created boundaries which defined women as a sex. 

In Sylvia’s case, her continued fascination with spinster-professionals— 
“these unmarried, free, untrammelled people whose lives belong to no man 
and who dedicate themselves to their work” (p. 53) echoed a commonplace 
sensibility from the previous century. Whitton, at various points in her own 
careers, articulated identical sentiments to those Sylvia expressed about her 
profession. “I wouldn’t mind putting it to you,” Sylvia wrote in her autobiog- 
raphy, “that senior women teachers should be required to be celibate, like 
priests, and that’s a good analogy” (p. 66). Family life dissipated one’s 
energies and interfered with total immersion in one’s art. Sylvia struggled 
for the duration of her marriage with a tension experienced by many artistic, 
capable, ambitious, and independent married women. She answered to the 
sensual and emotional needs of her own nature which required marriage to 
legitimate her by providing definition, support, security, and credibility. It 
seems logically the case, therefore, that if “senior” female teachers ought to 
remain celibate, then married female teachers, like herself, ought to remain 
subordinates in the school system. As a married woman she could remain in 
teaching because she worked in remote two-room schoolhouses as the subor- 
dinate partner in a teaching “couple.” 

Further, the male-ordered world that legitimated women as married and 
mothers, or stigmatized them as single and nonmothers, punished both 
Whitton and Ashton-Warner for their nonconformity as women. Both 
reacted to the prevailing gender-role assigned them in exhibitionism, aggres- 
sion, the demand for recognition, frustration, hostility, bullying, a sense of 
persecution (not altogether unjustified), and passionate outbursts. Their 
particular responses are not as significant as the fact that these represented 
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those binary oppositions that the female gender role prescribes: the one did 
not submit to the circumscribed nature of the social construction in the 
interest of career; the other was fraught with ambivalence about relegating 
her own needs to a secondary status. Whitton chose singleness, which in 
those days meant celibacy, but her female companion assumed a similar role 
as Keith Henderson did in Sylvia’s life. This represented an adaptation of 
marital gender roles whereas Sylvia reversed marital norms. Both women 
operated in a phallocentric ethos as if they were males with their partners 
assuming the conventional “female” gender roles. There are indeed many 
similarities in two such unlikely lives. 

At 23 years of age Sylvia probably forced a marriage to “a man called 
Keith Henderson, a wedding ring on my finger to show the girls, and a white 
wedding dress with a veil” (p. 60). She confessed that ambition, autonomy, 
and even art itself “vaporised before the white picture,” a grim reminder that 
in Sylvia’s marriage-compulsion and Whitton’s marriage-resistance men are 
not required to make the same choices, that men’s careers are not curtailed 
if they marry. Neither are their identities atrophied. Nowhere has this dilem- 
ma been so eloquently and passionately pursued than by Virginia Woolf in 
1928 when her fictional heroine, Orlando, cries out, “Life! A Lover! Not Life! 
A Husband” (Woolf, 1928, p. 152) At the same time she is overcome by an 
inexplicable tumult as she observes that “wedding rings abound.... Wedding 
rings were everywhere ... gold, or pinchback, thin, thick, plain, smooth” (p. 
151). She is obsessed by the sight of couples “indissolubly linked together.” 
Her ruminations 


were accompanied by such a tingling and twangling of the afflicted finger that 
she could scarcely keep her ideas in order.... Though the seat of her trouble 
seemed to be the left hand, she could feel herself poisoned through and 
through, and was forced at length to consider the most desperate of remedies, 
which was to yield completely and submissively to the spirit of the age, and 
take a husband. (p. 151) 


Sylvia is not a woman alone. She is no sport of history. She shares the 
crisis and paradox of being female in a male world. Fictional and real women 
share her dilemma whether they are represented by Kate Chopin’s suicidal 
heroine, Sylvia Plath’s longing for annihilation, or Woolf’s anorexia and 
madness, the meaninglessness of Emma Bovary’s domesticity, the adul- 
terous passion of Anna Karenina, the ordinary suburban housewife’s depres- 
sions, Kate Millet’s dependence on valium, or the door that was slammed 
shut by Ibsen’s Nora. These—to name only a few—expose something deeper 
in women’s restlessness than androcentric psychological constructs would 
have us believe. If Sylvia’s personality was disordered, if Sylvia was both 
poseur and imposter, if Sylvia was egocentric, vain, possessive, and demand- 
ing—then she shares a “pathology” that has inspired powerful fiction and 
some of the saddest and wasted lives ever lived by less heroic women. 
Psychology, in the final analysis, does not deal effectively with the existential 
paradox of women’s experience, and this is where an analysis of the pressures 
of heterosexism, patriarchy, and misogyny can regenerate traditional ap- 
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proaches to biography and add to rather than detract from present know- 
ledge. 

Hood asks of Sylvia’s life, “How could she reconcile the man-loving 
woman she wanted to be with the woman-loving artist who possessed her?” 
She could not—any more than any of Greer’s (1970) contingent of “female 
eunuchs” are able to. On the other hand, if marriage for Sylvia was the stage 
on which she acted out a crisis between “love and freedom,” it was also a 
theater she manipulated with consummate artistry so that it did not suppress 
her creativity. Indeed it facilitated her work. Ironically it was marriage that 
freed her even as it was resistance to it that freed Whitton. 

Nevertheless, some might consider her choice a conventional and even 
timid one. I am, for instance, reminded of similar ambiguity and tension 
contained in Dorothy Sayers’ (1936) novels and so rigorously analyzed by 
Leonardi in Dangerous by Degrees: Women at Oxford and the Somerville 
College Novelists (1989). Sayers’ and Sylvia’s writings are explosive with a 
sexual tension so close to the surface it is barely concealed in their trans- 
muted and acceptable heterosexual adaptations, suppression of darker emo- 
tions, and denial of the positive aspects of same-sex sociosexual experience. 
Both women identified poignantly and desperately with that heterosexist 
climate that shaped their craving for acceptance. Both women’s struggles 
with their contradictions and conformities resulted in bursts of creativity and 
inspiration. Despite evidence of an imaginative sublimation, it is not possible 
entirely to agree with the assertion that when Sylvia married “she carved out 
a freedom that was probably greater than any other lifestyle could ever have 
provided” (p. 64). A freedom it may have been, but one of great emotional 
cost and ultimately personal loss. In the first instance alone, dreams of Paris 
and travel were compressed and whittled down to an isolated and rude 
parochialism that was best represented in the comment, “to hear it whis- 
pered of someone we knew, ‘They’ve gone Maori teaching’ was to hear their 
doom and register the professional stigma on us all” (p. 73). Sylvia did not 
travel until Keith died and she was 61. Meanwhile, Keith “countered the 
eroding helplessness of his Sylvia-dominated existence” (p. 86). It was in 
some ways a bitter freedom. 

To Hood, a perceptive and sensitive biographer, the ongoing struggle 
between love and freedom is an archetypal one ultimately mirrored in the 
conflict between the artist and her milieu. To a feminist the ongoing struggle 
between love and freedom is an inevitable and irreconcilable psychic conflict 
exemplified by Sylvia as a woman exiled and alienated from herself in a 
patriarchal social order that tried to make her in its own image. In this 
context we can add to Hood’s germane and intelligent account and open up 
new ways of answering the question I found myself asking throughout— 
“Who is Sylvia?” 
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